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Abstract 

Each complex network (or class of networks) presents specific topologi- 
cal features which characterize its connectivity and highly influence the dy- 
namics of processes executed on the network. The analysis, discrimination, 
and synthesis of complex networks therefore rely on the use of measurements 
capable of expressing the most relevant topological features. This article 
presents a survey of such measurements. It includes general considerations 
about complex network characterization, a brief review of the principal mod- 
els, and the presentation of the main existing measurements. Important re- 
lated issues covered in this work comprise the representation of the evolution 
of complex networks in terms of trajectories in several measurement spaces, 
the analysis of the correlations between some of the most traditional mea- 
surements, perturbation analysis, as well as the use of multivariate statistics 
for feature selection and network classification. Depending on the network 
and the analysis task one has in mind, a specific set of features may be cho- 
sen. It is hoped that the present survey will help the proper application and 
interpretation of measurements. 
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1 Introduction 



Complex networks research can be conceptualized as lying at the intersection be- 
tween graph theory and statistical mechanics, which endows it with a truly mul- 
tidisciplinary nature. While its origin can be traced back to the pioneering works 
on percolation and random graphs by Flory Q, Rapoport E1|3]IU, and Erdos and 
Renyi EJ|6jEll, research in complex networks became a focus of attention only 
recently. The main reason for this was the discovery that real networks have char- 
acteristics which are not explained by uniformly random connectivity. Instead, net- 
works derived from real data may involve community structure, power law degree 
distributions and hubs, among other structural features. Three particular develop- 
ments have contributed particularly for the ongoing related developments: Watts 
and Strogatz's investigation of small-world networks (H, Barabasi and Albert's 
characterization of scale-free models [9|, and Girvan and Newman's identification 
of the community structures present in many networks (e.g. [10 1). 

Although graph theory is a well-established and developed area in mathematics 
and theoretical computer science (e.g., fTTIfT2l '). many of the recent developments 
in complex networks have taken place in areas such as sociology (e.g., fljlHH ). 
biology (e.g., 1151 1 and physics (e.g., fl6l \Tf\ ). Current interest has focused not 
only on applying the developed concepts to many real data and situations, but also 
on studying the dynamical evolution of network topology. Supported by the avail- 
ability of high performance computers and large data collections, results like the 
discovery of the scale-free structure of the Internet [ 18 1 and of the WWW 17911201 
were of major importance for the increased interest on the new area of complex 
networks, whose growing relevance has been substantiated by a large number of 
recent related publications. Reviews of such developments have been presented in 
four excellent surveys l2ll l22l l23l l24ll . introductory papers l25l l26l l27l [T71l and 
several books G!|^[T3]|30l|3TJ[T6l|32). For additional information about the 
related areas of percolation, disordered systems and fractals see l33l l34l 1351 ; for 
complex systems, see I36ll37ll38ll . 

One of the main reasons behind complex networks popularity is their flexibility 
and generality for representing virtually any natural structure, including those un- 
dergoing dynamical changes of topology. As a matter of fact, every discrete struc- 
ture such as lists, trees, or even lattices, can be suitably represented as special cases 
of graphs. It is thus little surprising that several investigations in complex network 
involve the representation of the structure of interest as a network, followed by 
an analysis of the topological features of the obtained representation performed in 
terms of a set of informative measurements. Another interesting problem consists 
of measuring the structural properties of evolving networks in order to character- 
ize how the connectivity of the investigated structures changes along the process. 
Both such activities can be understood as directed to the topological characteri- 
zation of the studied structures. Another related application is to use the obtained 
measurements in order to identify different categories of structures, which is di- 
rectly related to the area of pattern recognition l39l l40ll . Even when modeling 
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networks, it is often necessary to compare the realizations of the model with real 
networks, which can be done in terms of the respective measurements. Provided 
the measurements are comprehensive (ideally the representation by the measure- 
ments should be one-to-one or invertible), the fact that the simulated networks yield 
measurements similar to those of the real counterparts supports the validity of the 
model. 

Particular attention has recently been focused on the relationship between the 
structure and dynamics of complex networks, an issue which has been covered in 
two excellent comprehensive reviews However, relatively little attention 

has been given to the also important subject of network measurements (e.g. BTl l. 
Indeed, it is only by obtaining informative quantitative features of the networks 
topology that they can be characterized and analyzed and, particularly, their struc- 
ture can be fully related with the respective dynamics. The quantitative description 
of the networks properties also provides fundamental subsidies for classifying theo- 
retical and real networks into major categories. The present survey main objective 
is to provide a comprehensive and accessible review of the main measurements 
which can be used in to quantify important properties of complex networks. 

Network measurements are therefore essential as a direct or subsidiary resource 
in many network investigations, including representation, characterization, classi- 
fication and modeling. Figure^shows the mapping of a generic complex network 
into the feature vector jl, i.e. a vector of related measurements such as average 
vertex degree, average clustering coefficient, the network diameter, and so on. In 
case the mapping is invertible, in the sense that the network can be recovered from 
the feature vector, the mapping is said to provide a representation of the network. 
An example of invertible mapping for networks with uniform vertices and edges 
is the adjacency matrix (see|2]>. Note, however, that the characterization and clas- 
sification of networks does not necessarily require invertible measurements. An 
interesting strategy which can be used to obtain additional information about the 
structure of complex networks involves applying a transformation to the original 
network and obtaining the measurements from the resulting network, as illustrated 
in Figure In this figure, a transformation T (in this case, deletion of the vertices 
adjacent to just one other vertex) is applied over the original network to obtain a 
transformed structure from which new measurements j2r are extracted. In case the 
feature vectors p and fir correspond to the same set of measurements, it is also 
possible to consider the difference between these two vectors in order to obtain 
additional information about the network under analysis. 

Perturbations of networks, which can be understood as a special case of the 
transformation framework outlined above, can also be used to investigate the sen- 
sitivity of the measurements. Informally speaking, if the measurements considered 
in the feature vector are such that small changes of the network topology (e.g., 
add/remove a few edges or vertices) imply large changes in the measurements 
(large values of ||A/2j|), those measurements can be considered as being highly 
sensitive or unstable. One example of such an unstable measurement is the average 
shortest path length between two vertices (see Section [T6.2t . 
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Figure 1: The mapping from a complex network into a feature vector. Generic 
mappings can be used in order to obtain the characterization of the network in 
terms of a suitable set of measurements. In case the mapping is invertible, we have 
a complete representation of the original structure. 




Figure 2: Additional measurements of a complex network can be obtained by ap- 
plying a transformation T on it and obtaining a new feature vector j!?. The dif- 
ference Aj2 between the original and transformed features vectors can also be con- 
sidered in order to obtain additional insights about the properties of the original 
network. 



5 



(a) (b) 

Figure 3: Given a network undergoing some dynamical evolution (a) and a set of 
measurements (e.g., p,\ and ^2), trajectories can be defined in the feature space (b). 



Another possibility to obtain a richer set of measurements involves the consid- 
eration of several instances along development/growth of the network. A feature 
vector jl(t) is obtained at each "time" instant t along the growth. Figure |3] shows 
four instances of an evolving network and the respective trajectory defined in one 
of the possible feature (or phase) spaces involving two generic measurements \x\ 
and H2- In such a way, the evolution of a network can now be investigated in terms 
of a trajectory in a features space. 

Both the characterization and classification of natural and human-made struc- 
tures using complex networks imply the same important question of how to choose 
the most appropriate measurements. While such a choice should reflect the specific 
interests and application, it is unfortunate that there is no mathematical procedure 
for identifying the best measurements. There is an unlimited set of topological 
measurements, and they are often correlated, implying redundancy. Statistical ap- 
proaches to decorrelation (e.g., principal component analysis and canonical analy- 
sis) can help select and enhance measurements (see Section [T8b. but are not guar- 
anteed to produce optimal results (e.g GU)- Ultimately, one has to rely on her/his 
knowledge of the problem and available measurements in order to select a suitable 
set of features to be considered. For such reasons, it is of paramount importance 
to have a good knowledge not only of the most representative measurements, but 
also of their respective properties and interpretation. Although a small number of 
topological measurements, namely the average vertex degree, clustering coefficient 
and average shortest path length, were typically considered for complex network 
characterization during the initial stages of this area, a series of new and more so- 
phisticated features have been proposed and used in the literature along the last 
years. Actually, the fast pace of developments and new results reported in this very 
dynamic area makes it particularly difficult to follow and to organize the existing 
measurements. 

This review starts by presenting the basic concepts and notation in complex 
networks and follows by presenting several topological measurements. Illustra- 
tions of some of these measurements respectively to Erdos-Renyi, Watts-Strogatz, 
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Figure 4: The four main types of complex networks and their transformations. 
All network types can be derived from the weighted digraph through appropriate 
transformations. 

Barabasi-Albert, modular and geographical models are also included. The mea- 
surements are presented in sections organized according to their main types, in- 
cluding distance-based measurements, clustering coefficients, assortativity, entropies, 
centrality, subgraphs, spectral analysis, community-based measurements, hierar- 
chical measurements, and fractal dimensions. A representative set of such mea- 
surements is applied to the five considered models and the results are presented 
and discussed in terms of their cross-correlations and trajectories. The important 
subjects of measurement selection and assignment of categories to given complex 
networks are then covered from the light of formal multivariate pattern recognition, 
including the illustration of such a possibility by using canonical projections and 
Bayesian decision theory. 

2 Basic Concepts 

Figure |4] shows the four main types of complex networks, which include weighted 
digraphs (directed graphs), unweighted digraphs, weighted graphs and unweighted 
graphs. The operation of symmetry can be used to transform a digraph into a graph, 
and the operation of thresholding can be applied to transform a weighted graph into 
its unweighted counterpart. These types of graphs and operations are defined more 
formally in the following, stalling from the concept of weighted digraph, from 
which all the other three types can be derived. 

A weighted directed graph, G, is defined by a set Af(G) of N vertices (or 
nodes), a set £(G) of M edges (or links), and a mapping oj : £{G) i— ► R. Each 
vertex can be identified by an integer value i = 1, 2, . . . , N; the edges are identified 
by a pair that represents a connection going from vertex i to vertex j to 
which a weight u>(i,j) is associated. In the complex network literature, it is often 
assumed that no self-connections or multiple connections exist; i.e. there are no 
edges of the form (£, i) and for each pair of edges («i, ji) and {12,32) it holds that 
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i\ ^ 12 or ji / ]2- Graphs with self- or duplicate connections are sometimes called 
multigraphs, or degenerate graphs. Only non-degenerate graphs are considered 
henceforth. In an unweighted digraph, the edges have no weight, and the mapping 
u is not needed. For undirected graphs (weighted or unweighted), the edges have 
no directions; the presence of a edge in £{G) thus means that a connection 
exist from i to j and from j to i. 

A weighted digraph can be completely represented in terms of its weight matrix 
W, so that each element Wij = uj(i,j) expresses the weight of the connection from 
vertex i to vertex j. The operation of thresholding can be applied to a weighted 
digraph to produce an unweighted counterpart. This operation, henceforth repre- 
sented as St(W), is applied to each element of the matrix W, yielding the matrix 
A = 5t(W). The elements of the matrix A are computed comparing the corre- 
sponding elements of W with a specified threshold T; in case \wij\ > T we have 
aij = 1, otherwise ay = 0. The resulting matrix A can be understood as the ad- 
jacency matrix of the unweighted digraph obtained as a result of the thresholding 
operation. Any weighted digraph can be transformed into a graph by using the 
symmetry operation a(W) = W + W T , where W T is the transpose of W. 

For undirected graphs, two vertices i and j are said to be adjacent or neighbors 
if ^ 0. For directed graphs, the corresponding concepts are those of prede- 
cessor and successor: if Oy ^ then i is a predecessor of j and j is a successor 
of i. The concept of adjacency can also be used in digraphs by considering pre- 
decessors and successors as adjacent vertices. The neighborhood of a vertex i, 
henceforth represented as v(i), corresponds to the set of vertices adjacent to i. 

The degree of a vertex i, hence ki, is the number of edges connected to that 
vertex, i.e. the cardinality of the set (in the physics literature, this quantity is 
often called "connectivity" J22)). For undirected networks it can be computed as 



The average degree of a network is the average of ki for all vertices in the network, 



In the case of directed networks, there are two kinds of degrees: the out-degree, 
k° nt , equal to the number of outgoing edges (i.e. the cardinality of the set of suc- 
cessors), and the in-degree, k\ n , corresponding to the number of incoming edges 
(i.e. the cardinality of the set of predecessors), 




(1) 




(2) 




(3) 




(4) 



j 



(5) 
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Note that in this case the total degree is defined as fcj = kf 1 + kf nt . The average 
in- and out-degrees are the same (the network is supposed isolated) 



For weighted networks, the definitions of degree given above can be used, but a 
quantity called strength of i, Sj, defined as the sum of the weights of the corre- 
sponding edges, is more generally used ll42l : 



In the general case, two vertices of a complex network are not adjacent. In fact, 
most of the networks of interest are sparse, in the sense that only a small fraction 
of all possible edges are present. Nevertheless, two non-adjacent vertices i and j 
can be connected through a sequence of m edges (i, k±), (k\, A^X • • • , (k m -i,j); 
such set of edges is called a walk between i and j, and m is the length of the walk. 
We say that two vertices are connected if there is at least one walk connecting 
them. Many measurements are based on the length of these connecting walks (see 
Section^}. A loop or cycle is defined as a walk starting and terminating in the same 
vertex i and passing only once through each vertex k n . In case all the vertices and 
edges along a walk are distinct, the walk is a path. 

In undirected graphs, if vertices i and j are connected and vertices j and k are 
connected, then i and k are also connected. This property can be used to partition 
the vertices of a graph in non-overlapping subsets of connected vertices. These 
subsets are called connected components or clusters. 

If a network has too few edges, i.e. the average connectivity of its vertices (k) 
is too small, there will be many isolated vertices and clusters with a small number 
of vertices. As more edges are added to the network, the small clusters are con- 
nected to larger clusters; after some critical value of the connectivity, most of the 
vertices are connected into a giant cluster, characterizing the percolation [33 1 of the 
network. For the Erdos-Renyi graph in the limit N — > oo this happens at {k} = 1 
[28 ]. Of special interest is the distribution of sizes of the clusters in the percolation 
point and the fraction of vertices in the giant cluster. The critical density of edges 
(as well as average and standard deviation) needed to achieve percolation can be 
used to characterize network models or experimental phenomena. 

Table ^lists the basic symbols used in the paper. 

3 Complex Network Models 

With the intent of studying the topological properties of real networks, several net- 
work models have been proposed. Some of these models have become subject of 
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Table 1 : List of basic symbols used in the text. 



Symbol Concept 



M(G) 


Set of vertices of graph G 


£{G) 


Set of edges of graph G 


\X\ 


Cardinality of set X 


N 


Number of vertices, \M{G) 


M 


Number of edges, £(G) 


W 


Weight matrix 




Element of the weight matrix 


A 


Adjacency matrix 


a ij 


Element of the adjacency matrix 


hi 


Degree of a vertex i 


fcout 


Out-degree of a vertex i 


k 


In-degree of a vertex i 


Si 


Strength of a vertex i 


1/(0 


Set of neighbors of vertex i 


||X|| 


Sum of the elements of matrix X 



great interest, including random graphs, the small-world model, the generalized 
random graph and Barabasi-Albert networks. Other models have been applied to 
the study of the topology of networks with some specific features, as geographi- 
cal networks and networks with community structure. We do not intend to cover 
a comprehensive a review of the various proposed models. Instead, the next sub- 
sections present some models used in the discussion on network measurements 
(Sections CUni and EE). 

3.1 The Random Graph of Erdos and Renyi 

The random graph developed by Rapoport El |3j El and independently by Erdos 
and Renyi 16] can be considered the most basic model of complex networks. 
In their 1959 paper 0, Erdos and Renyi introduced a model to generate random 
graphs consisting of N vertices and M edges. Starting with N disconnected ver- 
tices, the network is constructed by the addition of L edges at random, avoiding 
multiple and self connections. Another similar model defines N vertices and a 
probability p of connecting each pair of vertices. The latter model is widely known 
as Erdos-Renyi (ER) model. Figure|5la) shows an example of this type of network. 

For the ER model, in the large network size limit (N — » oo), the average 
number of connections of each vertex (k), given by 

(k) = p(N - 1), (9) 

diverges if p is fixed. Instead, p is chosen as a function of iV to keep (k) fixed: 
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(a) (b) 

Figure 5: The random graph of Erdos and Renyi: (a) an example of a random 
graph and (b) average degree distribution over 10 random networks formed by 
10 000 vertices using a probability p = 0.2. 

p = (k)/(N — 1). For this model, P(k) (the degree distribution, see Section^ is 
a Poisson distribution (see Figure|3b) and Tabled. 

3.2 The Small-World Model of Watts and Strogatz 

Many real world networks exhibit what is called the small world property, i.e. most 
vertices can be reached from the others through a small number of edges. This 
characteristic is found, for example, in social networks, where everyone in the 
world can be reached through a short chain of social acquaintances ll43ll44l. This 
concept originated from the famous experiment made by Milgram in 1967 l45ll . 
who found that two US citizens chosen at random were connected by an average 
of six acquaintances. 

Another property of many networks is the presence of a large number of loops 
of size three, i.e. if vertex i is connected to vertices j and k, there is a high prob- 
ability of vertices j and k being connected (the clustering coefficient, Section |5j 
is high); for example, in a friendship network, if B and C are friends of A, there 
is a high probability that B and C are also friends. ER networks have the small 
world property but a small average clustering coefficient; on the other hand, reg- 
ular networks with the second property are easy to construct, but they have large 
average distances. The most popular model of random networks with small world 
characteristics and an abundance of short loops was developed by Watts and Stro- 
gatz |8) and is called the Watts-Strogatz (WS) small-world model. They showed 
that small-world networks are common in a variety of realms ranging from the C. 
elegans neuronal system to power grids. This model is situated between an or- 
dered finite lattice and a random graph presenting the small world property and 
high clustering coefficient. 

To construct a small-word network, one starts with a regular lattice of N ver- 
tices (Figure |6j in which each vertex is connected to k nearest neighbors in each 
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Figure 6: The construction of a small- word network according to Watts and Stro- 
gatz: A regular network has its edges rewired with probability p. For p « the 
network is regular, with many triangles and large distances, for p 1, the network 
becomes a random network, with small distances and few triangles. 




(a) (b) 

Figure 7: The small- world model of Watts and Strogatz: (a) an example of a net- 
work with N = 64 vertices, k = 2, p = 0.1, and (b) average degree distribution 
over 10 WS networks with 10 000 vertices, k = 25 and p = 0.3. 

direction, totalizing 2k connections, where iV 3> K S> log(iV) 3> 1. Next, each 
edge is randomly rewired with probability p. When p = we have an ordered 
lattice with high number of loops but large distances and when p — ► 1, the network 
becomes a random graph with short distances but few loops. Watts and Strogatz 
have shown that, in an intermediate regime, both short distances and a large num- 
ber of loops are present. Figure Et a ) shows an example of a Watts-Strogatz net- 
work. Alternative procedures to generate small-world networks based on addition 
of edges instead of rewiring have been proposed l46l l47ll . but are not discussed 
here. 

The degree distribution for small-world networks is similar to that of random 
networks, with a peak at (k) = 2k (see also Tabled 



12 



3.3 Generalized Random Graphs 

A common way to study real networks is to compare their characteristics with 
the values expected for similar random networks. As the degrees of the vertices 
are important features of the network, it is interesting to make the comparison 
with networks with the same degree distribution. Models to generate networks 
with a given degree distribution, while being random in other aspects, have been 
proposed. 

Bender and Canfield l48ll first proposed a model to generate random graphs 
with a pre-defined degree distribution called configuration model. Later, Molloy 
and Reed [ 49 , 50 ] proposed a different method that produces multigraphs (i.e. loops 
and multiple edges between the same pair of vertices are allowed). 

The common method used to generate this kind of random graph involves se- 
lecting a degree sequence specified by a set {k{} of degrees of the vertices drawn 
from the desired distribution P(k). Afterwards, to each vertex i is associated a 
number ki of "stubs" or "spokes" (ends of edges emerging from a vertex) accord- 
ing to the desired degree sequence. Next, pairs of such stubs are selected uniformly 
and joined together to form an edge. When all stubs have been used up, a random 
graph that is a member of the ensemble of graphs with that degree sequence is 
obtained ED El El. 

Another possibility, the rewiring method, is to start with a network (possibly a 
real network under study) that already has the desired degree distribution, and then 
iteratively chose two edges and interchange the corresponding attached vertices 
1 54 1 . This rewiring procedure is used in some results presented in Section [T6.2l 

Due to its importance and amenability to analytical treatment, many works deal 
with this model, including the papers of Newman 123*1 . Aiello et al. "531 . Chung 
and Lu [56 1 and Cohen and Havlin [ 57 1. 

3.4 Scale-free Networks of Barabasi and Albert 

After Watts and Strogatz's model, Barabasi and Albert [9] showed that the degree 
distribution of many real systems is characterized by an uneven distribution. In- 
stead of the vertices of these networks having a random pattern of connections with 
a characteristic degree, as with the ER and WS models (see FigureEIa)), some ver- 
tices are highly connected while others have few connections, with the absence of 
a characteristic degree. More specifically, the degree distribution has been found 
to follow a power law for large k, 

P(k) ~ fc~ 7 (10) 

(see Figure Ob)). These networks are called scale-free networks. 

A characteristic of this kind of network is the existence of hubs, i.e. vertices 
that are linked to a significant fraction of the total number of edges of the network. 

The Barabasi- Albert (BA) network model is based on two basic rules: growth 
and preferential attachment. The network is generated starting with a set of mo 
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Figure 8: The scale-free network of Barabasi and Albert, (a) an example of a scale- 
free network and (b) average degree distribution over 10 Barabasi-Albert networks 
formed by 10, 000 vertices using m = 5. The degree distribution follows a power 
law, in contrast to that presented in Figure |5] 

vertices; afterwards, at each step of the construction the network grows with the 
addition of new vertices. For each new vertex, m new edges are inserted between 
the new vertex and some previous vertex. The vertices which receive the new edges 
are chosen following a linear preferential attachment rule, i.e. the probability of the 
new vertex i to connect with an existing vertex j is proportional to the degree of j, 



Thus, the most connected vertices have greater probability to receive new vertices. 
This is known as "the rich get richer" paradigm. 

Figure [SI a) shows an example of a Barabasi-Albert network. 

3.5 Networks with Community Structure 

Some real networks, such as social and biological networks, present modular struc- 
ture iflOl . These networks are formed by sets or communities of vertices such that 
most connections are found between vertices inside the same community, while 
connections between vertices of different communities are less common. A model 
to generate networks with this property was proposed by Girvan and Newman fTOl . 
This model is a kind of random graph constructed with different probabilities. Ini- 
tially, a set of N vertices is classified into c communities. At each following step, 
two vertices are selected and linked with probability p m , if they are in the same 
community, or p out , if they are in different communities. The values of p in and 
Pout should be chosen so as to generate networks with the desired shaipness in the 
distinction of the communities. When p out <C Pm, the communities can be easily 
identified. On the other hand, when p out ~ p m , the communities become blurred. 
Figure |3 a) presents a network generated by using the procedure above. 



V(i - j) 




(11) 
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Table 2: Analytical result of some basic measurements for the Erd6s-Renyi,Watts-Strogatz and Barabasi- Albert network models. 



Measurement 


Erdos-Renyi 


Watts-Strogatz 


Barabasi- Albert 


Degree 
distribution 


p(k) - *- { 7 )k 


p(k) - Y:r-? {k - K > K) ( K ) a P ) v-* ( jrC V e-*« 


P(k) ~ k~ 3 


Average 
vertex degree 


(k) = p(N - 1) 


(k) = 2k* 


(k) = 2m 


Clustering 
coefficient 


C = p 




C ~ iV" - 75 


Average 
path length 


In N 

1 ~ ln(fc) 


e(N, P )~p T f(N P T )* 


£ log(logjV) 



In WS networks, the value k represents the number of neighbors of each vertex in the initial regular network (in Figure[6] K = 4). 
The function f(u) = constant if u <C 1 or f(u) = \n(u)/u if u 3> 1. 



(a) (b) 

Figure 9: (a) An example of a random network with community structure formed 
by 64 vertices divides in 4 communities, (b) An example of geographical network 
formed by 64 vertices. 

3.6 Geographical Models 

Complex networks are generally considered as lying in an abstract space, where 
the position of vertices has no particular meaning. In the case of several kinds of 
networks, such as protein-protein interaction networks or networks of movie ac- 
tors, this consideration is reasonable. However, there are many networks where 
the position of vertices is particularly important as it influences the network evo- 
lution. This is the case of highway networks or the Internet, for example, where 
the position of cities and routers can be localized in a map and the edges between 
correspond to real physical entities, such as roads and optical fibers [58 1. This kind 
of networks is called geographical or spatial networks. Other important examples 
of geographical networks are power grids I59ll60ll . airport networks I6l"ll62ll63ll . 
subway I5H and neural networks [65]. 

In geographical networks, the existence of a direct connection between vertices 
can depend on a lot of constraints such as the distance between them, geographical 
accidents, available resources to construct the network, territorial limitation and 
so on. The models considered to represent these networks should consider these 
constraints. 

A simple way to generate geographical networks, used in the results described 
in SectionsEOEl and^D is to distribute N vertices at random in a two-dimensional 
space Q and link them with a given probability which decays with the distance, for 
instance 

Vii -» j) ~ e~ Xs ^ (12) 

where is the geographical distance of the vertices and A fixes the length scale 
of the edges. This model generates a Poisson degree distribution as observed for 
random graphs and can be used to model road networks (see Figure|9jb)). Alterna- 
tively, the network development might start with few nodes while new nodes and 
connections are added at each subsequent time step (spatial growth). Such a model 
is able to generate a wide range of network topologies including small-world and 
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linear scale-free networks [ 66 1. 



4 Measurements Related with Distance 

For undirected, unweighted graphs, the number of edges in a path connecting ver- 
tices i and j is called the length of the path. A geodesic path (or shortest path), 
between vertices i and j, is one of the paths connecting these vertices with mini- 
mum length (many geodesic paths may exist between two vertices); the length of 
the geodesic paths is the geodesic distance d^ between vertices % and j. If the 
graph is weighted, the same definition can be used, but generally one is interested 
in taking into account the edge weights. Two main possibilities include: first, the 
edge weights may be proportionally related to some physical distance, for example 
if the vertices correspond to cities and the weights to distances between these cities 
through given highways. In this case, one can compute the distance along a path 
as the sum of the weights of the edges in the path. Second, the edge weights may 
reflect the strength of connection between the vertices, for example if the vertices 
are Internet routers and the weights are the bandwidth of the edges, the distance 
corresponding to each edge can be taken as the reciprocal of the edge weight, and 
the path length is the sum of the reciprocal of the weight of the edges along the 
path. If there are no paths from vertex i to vertex j, then dij = oo. For digraphs, 
the same definitions can be used, but in general dij / dji, as the paths from vertex 
i to vertex j are different from the paths from j to i. 

Distance is an important characteristic that depends on the overall network 
structure. The following describes some measurements based on vertex distance. 



4.1 Average Distance 

We can define a network measurement by computing the mean value of dij , known 

as average geodesic distance: 

<-if(*rruS> (13) 

A problem with this definition is that it diverges if there are unconnected vertices 
in the network. To circumvent this problem, only connected pairs of vertices are 
included in the sum. This avoids the divergence, but introduces a distortion for 
networks with many unconnected pairs of vertices, which will show a small value 
of average distance, expected only for networks with a high number of connections. 
Latora and Marchiori l67l proposed a closely related measurement that they called 
global efficiency: 

where the sum takes all pairs of vertices into account. This measurement quantifies 
the efficiency of the network in sending information between vertices, assuming 
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that the efficiency for sending information between two vertices i and j is propor- 
tional to the reciprocal of their distance. The reciprocal of the global efficiency is 
the harmonic mean of the geodesic distances: 



As Eq. (TT31 does not present the divergence problem of Eq. (fT3l . it is therefore a 
more appropriate measurement for graphs with more than one connected compo- 
nent. 

The determination of shortest distances in a network is only possible with 
global information on the structure of the network. This information is not always 
available. When global information is unavailable, navigation in a network must 
happen using limited, local information and a specific algorithm. The effective dis- 
tance between two vertices is thus generally larger than the shortest distance, and 
dependent on the algorithm used for navigation as well as network structure 1681 . 



4.2 Vulnerability 

In infrastructure networks (like WWW, the Internet, energy supply, etc), it is im- 
portant to know which components (vertices or edges) are crucial to their best func- 
tioning. Intuitively, the critical vertices of a network are their hubs (vertices with 
higher degree), however there are situations in which they are not necessarily most 
vital for the performance of the system which the network underlies. For instance, 
all vertices of a network in the form of a binary tree have equal degree, therefore 
there is no hub, but disconnection of vertices closer to the root and the root itself 
have a greater impact than of those near the leaves. This suggests that networks 
have a hierarchical property, which means that the most crucial components are 
those in higher positions in the hierarchy. 

A way to find critical components of a network is by looking for the most 
vulnerable vertices. If we associate the performance of a network with its global 
efficiency, Eq. <fT4l> . the vulnerability of a vertex can be defined as the drop in 
performance when the vertex and all its edges are removed from the network [ 69 1 

E-Ei 

Vi = — ^ (16) 

where E is the global efficiency of the original network and E\ is the global 
efficiency after the removal of the vertex i and all its edges. As suggested by 
Gol'dshtein et al. [69], the ordered distribution of vertices with respect to their 
vulnerability Vi is related to the network hierarchy, thus the most vulnerable (criti- 
cal) vertex occupies the highest position in the network hierarchy. 

A measurement of network vulnerability [:70 1 is the maximum vulnerability for 
all of its vertices: 

V = maxVi. (17) 



18 



5 Clustering and Cycles 



A characteristic of the Erdos-Renyi model is that the local structure of the network 
near a vertex tends to be a tree. More precisely, the probability of loops involving 
a small number of vertices goes to in the large network size limit. This is in 
marked contrast with the profusion of short loops which appear in many real-world 
networks. Some measurements proposed to study the cyclic structure of networks 
and the tendency to form sets of tightly connected vertices are described in the 
following. 



5.1 Clustering Coefficients 

One way to characterize the presence of loops of order three is through the cluster- 
ing coefficient. 

Two different clustering coefficients are frequently used. The first, also known 
as transitivity ITU , is based on the following definition for undirected unweighted 
networks: 

where is the number of triangles in the network and N% is the number of 
connected triples. The factor three accounts for the fact that each triangle can be 
seen as consisting of three different connected triples, one with each of the vertices 
as central vertex, and assures that < C < 1. A triangle is a set of three vertices 
with edges between each pair of vertices; a connected triple is a set of three vertices 
where each vertex can be reached from each other (directly or indirectly), i.e. two 
vertices must be adjacent to another vertex (the central vertex). Therefore we have 

iV A = aija ik a jk , (19) 

k>j>i 

N 3 = ^2 (aija ik + ajia jk + a ki a kj ), (20) 

k>j>i 

where the are the elements of the adjacency matrix A and the sum is taken over 
all triples of distinct vertices i, j, and k only one time. 

It is also possible to define the clustering coefficient of a given vertex i [8 1 as: 

= (2.) 

where N&(i) is the number of triangles involving vertex i and N 3 (i) is the number 
of connected triples having i as the central vertex: 

#a(«) = ^aijaikajk, (22) 
k>j 

N 3 (i) = ^aijciik, (23) 

k>j 



19 



If ki is the number of neighbors of vertex i, then N^(i) = ki{ki — l)/2. N&(i) 
counts the number of edges between neighbors of i. Representing the number of 
edges between neighbors of i as li, Eq. d2TT) can be rewritten as: 

Using Ci, an alternative definition of the network clustering coefficient (differ- 
ent from that in Eq. (118b ) is 

£=^£Q- (25) 

i 

The difference between the two definitions is that the average in Eq. dTSb gives the 
same weight to each triangle in the network, while Eq. d25l gives the same weight 
to each vertex, resulting in different values because vertices of higher degree are 
possibly involved in a larger number of triangles than vertices of smaller degree. 

For weighted graphs, Barthelemy l42ll introduced the concept of weighted clus- 
tering coefficient of a vertex, 

n w 1 \- Wij + w ik 

Li = 771 _ i\ Z> o aijUikajk, (26) 

where the normalizing factor Si(ki — 1) (s, is the strength of the vertex, see Sec- 
tion 13) assures that < Cf < 1. From this equation, a possible definition of 
clustering coefficient for weighted networks is 

i 

Another definition for clustering in weighted networks f72ll is based on the inten- 
sity of the triangle subgraphs, (see Section [T2~2b . 

Cf = h{ ^_ 1} ^(uiijWjkWki) 1 / 3 , (28) 

where tbij = Wij/ maxjj Wij. 

Given the clustering coefficients of the vertices, the clustering coefficient can 
be expressed as a function of the degree of the vertices: 

C{k) = ^± C f ki \ (29) 

where 5ij is the Kronecker delta. For some networks, this function has the form 
C{k) ~ k~ a . This behavior has been associated with a hierarchical structure of the 
network, with the exponent a being called its hierarchical exponent E3- Soffer 
and Vazquez ll74l found that this dependence of the clustering coefficient with k 
is to some extent due to the degree correlations (Section |6j) of the networks, with 
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vertices of high degree connecting with vertices of low degree. They suggested a 
new definition of clustering coefficient without degree correlation bias: 




(30) 



where ij is the number of edges between neighbors of i and Ui is the maximum 
number of edges possible between the neighbors of vertex i, considering their ver- 
tex degrees and the fact that they are necessarily connected with vertex i. 

5.2 Cyclic Coefficient 

Kim and Kim [75 ] defined a cyclic coefficient in order to measure how cyclic a 
network is. The local cyclic coefficient of a vertex i is defined as the average of the 
inverse of the sizes of the smallest cycles formed by vertex i and its neighbors, 



where Sijk is the size of the smallest cycle which passes through vertices i, j and 
k. Note that if vertices j and k are connected, the smallest cycle is a triangle and 
Sijk = 3. If there is no loop passing through i, j and k, then these vertices are tree- 
like connected and S^k = oo. The cyclic coefficient of a network is the average of 
the cyclic coefficient of all its vertices: 



5.3 Rich-Club Coefficient 

In science, influential researchers of some areas tend to form collaborative groups 
and publish papers together IToTl . This tendency is observed in other real networks 
and reflect the tendency of hubs to be well connected with each other. This phe- 
nomenon, known as rich-club, can be measured by the rich-club coefficient, intro- 
duced by Zhou and Mondragon [77 1. The rich-club of degree k of a network G is 
the set of vertices with degree greater than k, 7L{k) = {v 6 J\f(G)\k v > k}. The 
rich-club coefficient of degree k is given by 



(the sum corresponds to two times the number of edges between vertices in the 
club). This measurement is similar to that defined before for the clustering coef- 
ficient (see Eq. i24i ). giving the fraction of existing connections among vertices 
with degree higher than k. 




(31) 




(32) 




(33) 
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Colizza et al. l76ll derived an analytical expression of the rich-club coefficient, 
valid for uncorrelated networks, 



(k)N' 



(34) 



The definition of the weighted rich-club coefficient for weighted networks is 
straightforward. If TV" (s) is the set of vertices with strength greater than s, TV" (s) = 

{v e G\s v > s}, 



(the sum in the numerator give two times the weight of the edges between elements 
of the rich-club, the sum in the denominator gives the total strength of the vertices 
in the club). 

6 Degree Distribution and Correlations 

The degree is an important characteristic of a vertex l!78l . Based on the degree of 
the vertices, it is possible to derive many measurements for the network. One of 
the simplest is the maximum degree: 



Additional information is provided by the degree distribution, P(k), which 
expresses the fraction of vertices in a network with degree k. An important property 
of many real world networks is their power law degree distribution |9j. For directed 
networks there are an out-degree distribution P out (k ont ), an in-degree distribution 
P m (k m ), and the joint in-degree and out-degree distribution P 10 (k m , k ont ). The 
latter distribution gives the probability of finding a vertex with in-degree kin and 
out-degree k ou t. Similar definitions considering the strength of the vertices can be 
used for weighted networks. An objective quantification of the level to which a log- 
log distribution of points approach a power law can be provided by the respective 
Pearson coefficient, which is henceforth called straightness and abbreviated as st. 

It is often interesting to check for correlations between the degrees of different 
vertices, which have been found to play an important role in many structural and 
dynamical network properties 1791 . The most natural approach is to consider the 
correlations between two vertices connected by an edge. This correlation can be 
expressed by the joint degree distribution P(k, k'), i.e. as the probability that an 
arbitrary edge connects a vertex of degree k to a vertex of degree k'. Another way 
to express the dependence between vertex degrees is in terms of the conditional 
probability that an arbitrary neighbor of a vertex of degree k has degree k' I801I8T1 . 




(35) 



max — 



max ki . 



(36) 



P{k'\k) 



(k)P(k,k') 
kP(k) 



(37) 
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Notice that Y^k' P{k'\k) = 1. For undirected networks, P(k, k') = P(k', k) and 
kl P(k\k')P(k') = kP(k'\k)P(k). For directed networks, k is the degree at the 
tail of the edge, k! is the degree at the head, both k and k! may be in-, out-, or total 
degrees, and in general P(k, k') 7^ P(k, k'). For weighted networks the strength s 
can be used instead of k. 

P(k, k') and P{k\k') characterize formally the vertex degree correlations, but 
they are difficult to evaluate experimentally, especially for fat-tailed distributions, 
as a consequence of the finite network size and the resulting small sample of ver- 
tices with high degree. This problem can be addressed by computing the average 
degree of the nearest neighbors of vertices with a given degree k ll82l . which is 
given by 



If there are no correlations, k nn (k) is independent of k, k nn (k) = (k 2 ) / (k). When 
k n n(k) is an increasing function of k, vertices of high degree tend to connect with 
vertices of high degree, and the network is classified as assortative, whereas when- 
ever k nn (k) is a decreasing function of k, vertices of high degree tend to connect 
with vertices of low degree, and the network is called disassortative fill. 

Another way to determine the degree correlation is by considering the Pearson 
correlation coefficient of the degrees at both ends of the edges [ 83 1 : 



where M is the total number of edges. If r > the network is assortative; if r < 0, 
the network is disassortative; for r = there are no correlation between vertex 
degrees. 

Degree correlations can be used to characterize networks and to validate the 
ability of network models to represent real network topologies. Newman [83] 
computed the Pearson correlation coefficient for some real and model networks 
and discovered that, although the models reproduce specific topological features 
such as the power law degree distribution or the small-world property, most of 
them (e.g., the Erdo-Renyi and Barabasi-Albert models) fail to reproduce the as- 
sortative mixing (r = for the Erdo-Renyi and Barabasi-Albert models). Further, 
it was found that the assortativity depends on the type of network. While social 
networks tend to be assortative, biological and technological networks are often 
disassortative. The latter property is undesirable for practical puiposes, because 
assortative networks are known to be resilient to simple target attack, at the least. 
So, for instance, in disease propagation, social networks would ideally be vulnera- 
ble (i.e. the network is dismantled into connected components, isolating the focus 
of disease) and technological and biological networks should be resilient against 
attacks. The degree correlations are related to the network evolution process and, 
therefore, should be taken into account in the development of new models as done, 




(38) 



k' 



M 2~^>i kikjdij ^ Yl,j>i \ {k% + kj)dij 



(39) 



r = 
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for instance, in the papers by Catanzaro et al. l84ll on social networks, Park and 
Newman [85) on the Internet, and Berg et al. [86] on protein interaction networks. 
Degree correlations also have strong influence on dynamical processes like insta- 
bility |87|, synchronization |88l|89) and spreading (80l |90] |3T). F ° r additional 
discussions about dynamical process as in networks see Ref. E4"l . 



7 Networks with Different Vertex Types 

Some networks include vertices of different types. For example, in a sociological 
network where the vertices are people and the edges are a social relation between 
two persons (e.g., friendship), one may be interested in answering questions like: 
how probable is a friendship relation between two persons of different economic 
classes? In this case, it is interesting to consider that the vertices are not homo- 
geneous, having different types. In the following, measurements associated with 
such kind of networks are discussed. 



7.1 Assortativity 

For networks with different types of vertices, a type mixing matrix E can be de- 
fined, with elements e st such that e s t is the number of edges connecting vertices 
of type s to vertices of type t (or the total strength of the edges connecting the two 
vertices of the given types, for weighted networks). It can be normalized as 

E 

= Pip (40) 

where \\X\\ (cardinality) represents the sum of all elements of matrix X. 

The probability of a vertex of type s having a neighbor of type t therefore is 

pten*>)(t\s) = e 1 . (41) 

Note that Y,t P {typc) {t\s) = 1. 

P^ c \s,t) and E can be used to quantify the tendency in the network of 
vertices of some type to connect to vertices of the same type, called assortativity. 
We can define an assortativity coefficient [92j|23l as: 

3 = S. *■"•>(.!.) -i. (42) 

v N T -1 

where Nt is the number of different vertex types in the network. It can be seen 
that < Q < 1, where Q = 1 for a perfectly assortative network (only edges 
between vertices of the same type) and Q = for random mixing. But each vertex 
type has the same weight in Q, regardless of the number of vertices of that type. 
An alternative definition that avoids this problem 19311 is: 

_ TrE-\\E 2 \\ 
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It is interesting to associate the vertex type to its degree. The Pearson corre- 
lation coefficient of vertex degrees, Eq. d39l . can be considered as an assortativity 
coefficient for this case. 



7.2 Bipartivity Degree 

A special case of disassortativity is that of bipartite networks. A network is called 
bipartite if its vertices can be separated into two sets such that edges exist only 
between vertices of different sets. It is a known fact that a network is bipartite if 
and only if it has no loops of odd length (e.g. [94]). Although some networks are 
bipartite by construction, others, like a network of sexual contacts, are only approx- 
imately bipartite. A way to quantify how much a network is bipartite is therefore 
needed. A possible measurement is based on the number of edges between vertices 
of the same subset in the best possible division [94], 

hl-^^, (44) 

where -d{i) maps a vertex i to its type and 5 is the Kronecker delta. The smallest 
value of b for all possible divisions is the bipartivity of the network. The problem 
with this measurement is that its computation is NP-complete, due to the necessity 
of evaluating b for the best possible division. A measurement that approximates b 
but is computationally easier was proposed in l94l . based on a process of marking 
the minimum possible number of edges as responsible for the creation of loops of 
odd length. 

Another approach is based on the subgraph centrality ll95l (Section^OJl. The 
subgraph centrality of the network, Eq. d84l . is divided in a part due to even closed 
walks and a part due to odd closed walks (a closed walk is a walk, possibly with 
repetition of vertices, ending on the starting vertex). As odd closed walks are not 
possible in bipartite networks, the fraction of the subgraph centrality of the network 
due to even closed walks can be used as the bipartivity degree [95 1: 

_ ggeven _ EjLl cosh A j 

P ~ SC ~ E f =ie ^ ' (45) 

where SC is the subgraph centrality of the network (Section ^OJ), SC even is the 
subgraph centrality due to the even closed walks and the Xj are the eigenvalues of 
the adjacency matrix of the network. 



8 Entropy 

Entropy is a key concept in thermodynamics, statistical mechanics [96 1 and infor- 
mation theory [97]. It has important physical implications related to the amount 
of "disorder" and information in a system 19$]. In information theory, entropy de- 
scribes how much randomness is present in a signal or random event l99ll . This 
concept can be usefully applied to complex networks. 
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8.1 Entropy of the Degree Distribution 



The entropy of the degree distribution provides an average measurement of the 
heterogeneity of the network, which can be defined as 

H = - ^ P (k) log P(k). (46) 

k 

The maximum value of entropy is obtained for a uniform degree distribution and 
the minimum value H m i n = is achieved whenever all vertices have the same 
degree 1 1 001 . Network entropy has been related to the robustness of networks, 
i.e. their resilience to attacks 11001 . and the contribution of vertices to the network 
entropy is correlated with lethality in protein interactions networks 1 10 1 B . 

Sole and Valverde [ 102] suggested the use of the remaining degree distribution 
to compute the entropy. The remaining degree of a vertex at one end of an edge is 
the number of edges connected to that vertex not counting the original edge. The 
remaining degree distribution can be computed as 

, W - <* +1 $* + 1 > . ,47) 

The entropy of the remaining degree is given by 

H* = -^2q(k)logq(k). (48) 

A; 



8.2 Search Information, Target Entropy and Road Entropy 

The structure of a complex network is related to its reliability and information 
propagation speed. The difficulty while searching information in the network can 
be quantified through the information entropy of the network [ 103II104I . Rosvall et 
al. 11051 introduced measurements to quantify the information associated to locat- 
ing a specific target in a network. Let p(i, b) be a shortest path starting at vertex i 
and ending at vertex b. The probability to follow this path in a random walk is 

vWM = j II r^r ( 49 > 

je P (i,b) 3 

where kj is the degree of vertex j and the product includes all vertices j in the path 
p(i, b) with the exclusion of i and b. The search information, corresponding to the 
total information needed to identify one of the shortest paths between i and b, is 
given by 

sM) = -i og2 PWM (50) 

{P(i,b)} 

where the sum is taken over all shortest paths p(i, b) from i to b. 
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The average search information characterizes the ease or difficulty of naviga- 
tion in a network and is given by 11051 

y 

This value depends on the structure of the network. As discussed by Rosvall et al. 
11051 . city networks are more difficult to be navigated than their random counter- 
parts. 

In order to measure how difficult it is to locate vertices in the network starting 
from a given vertex i, the access information is used, 

A = ^J>(i,6), (52) 

b 

which measures the average number of "questions" needed to locate another vertex 
starting from i. To quantify how difficult is to find the vertex b starting from the 
other vertices in the network, the hide information is used, 

?4 = ^ (53) 

% 

Note that the average value of A% and Tib for a network is S: Yli-Ai = 

J2 b n b = SN. 

Considering the exchange of messages in the network, it is possible to define 
entropies in order to quantify the predictability of the message flow. Assuming that 
messages always flow through shortest paths and all pairs of vertices exchange the 
same number of messages at the same rate, the following entropies can be defined 

una: 

% = -^2ajiCijlog 2 Cij, (54) 

ij 

TZi = -y^ajibijlog 2 bij, (55) 

ij 

where aji is an element of the adjacency matrix, Cjj is the fraction of messages 
targeted at vertex i that comes through vertex j, and bij is the fraction of messages 
that goes through vertex i coming from vertex j. In addition, % is the target entropy 
of vertex i and TZi is the road entropy of vertex i. Low values of these entropies 
mean that the vertex from where the next message originates {to vertex i or passing 
through vertex i) can be easily predicted. 

As a general measurement of the flows of messages, we can define target and 
road entropies for the network as averages among all vertices 



r = (56) 

i 

n = ^J>, (57) 
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As shown in 11031 . these quantities are related to the organization of the network: 
a network with a low value of T has a star structure and a low value of 1Z means 
that the network is composed by hubs connected in a string. 

Further works related to searchability in networks have been reported by Trusina 
et al. 11061 . who defined a search information weighted by the traffic on the net- 
work, and Rosvall et al. 11071 who studied networks with higher order organization 
like modular or hierarchical structure. 

9 Centrality Measurements 

In networks, the greater the number of paths in which a vertex or edge participates, 
the higher the importance of this vertex or edge for the network. Thus, assuming 
that the interactions follow the shortest paths between two vertices, it is possible to 
quantify the importance of a vertex or a edge in terms of its betweenness centrality 
fT08l defined as: 



where a(i, u,j) is the number of shortest paths between vertices i and j that pass 
through vertex or edge u, a(i,j) is the total number of shortest paths between i and 
j, and the sum is over all pairs i, j of distinct vertices. 

When one takes into account the fact that the shortest paths might not be known 
and instead a search algorithm is used for navigation (see Section I4.lt . the be- 
tweenness of a vertex or edge must be defined in terms of the probability of it be- 
ing visited by the search algorithm. This generalization, which was introduced by 
Arenas et al. 11091 . subsumes the betweenness centrality based on random walks 
as proposed by Newman II 101 . 

The central point dominance is defined as 11081 



where -B max is the largest value of betweenness centrality in the network. The 
central point dominance will be for a complete graph and 1 for a star graph in 
which there is a central vertex included in all paths. Other centrality measurements 
can be found in the interesting survey by Koschiitzki et al. II 1 1H . 

10 Spectral Measurements 

The spectrum of a network corresponds to the set of eigenvalues Aj (i = 1, 2, . . . , N) 
of its adjacency matrix A. The spectral density of the network is defined as 11121 




(58) 




(59) 



EH: 




(60) 
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where 5(x) is the Dirac delta function and p approaches a continuous function 
as N — > oo; e.g., for Erdos-Renyi networks, if p is constant as N — > oo, p(X) 
converges to a semicircle 11121 . Also, the eigenvalues can be used to compute the 
/th-moments, 

ii,i 2 ,...,ii i 

The eigenvalues and associated eigenvectors of a network are related to the 
diameter, the number of cycles and connectivity properties of the network 11121 
1 1 3 1 . The quantity Di = NMi is the number of paths returning to the same vertex 
in the graph passing through / edges. Note that these paths can contain already 
visited vertices. Because in a tree-like graph a return walk is only possible going 
back through the already visited edges, the presence of odd moments is a sure sign 
of cycles in the graph; in particular, as a walk can go through three edges and return 
to its starting vertex only by following three different edges (if self-connections are 
not allowed), D% is related with the number of triangles in the network 11131 . 

In addition, spectral analysis allows the identification whether a network is 
bipartite (if it does not contain any odd cycle J95), see section 17721 . characteriz- 
ing models of real networks 11114111151 . and visualizing networks 11161 . In ad- 
dition, spectral analysis of networks is important to determine communities and 
subgraphs, as discussed in the next section. 

11 Community Identification and Measurements 

Many real networks present an inhomogeneous connecting structure characterized 
by the presence of groups whose vertices are more densely interconnected one 
another than with the rest of the network. This modular structure has been found 
in many kinds of networks such as social networks 11171 II 18l . metabolic networks 
11191 and in the worldwide flight transportation network l62l . Figure fTOl presents 
a network with a well-defined community structure. 

Community identification in large networks is particularly useful because ver- 
tices belonging to the same community are more likely to share properties and 
dynamics. In addition, the number and characteristics of the existing communities 
provide subsidies for identifying the category of a network as well as understand- 
ing its dynamical evolution and organization. In the case of the World Wide Web, 
for instance, pages related to the same subject are typically organized into commu- 
nities, so that the identification of these communities can help the task of seeking 
for information. Similarly, in the case of the Internet, information about communi- 
ties formed by routers geographically close one another can be considered in order 
to improve the flow of data. 

Despite the importance of the concept of community, there is no consensus 
about its definition. An intuitive definition was proposed by Radichi et al. 11201 
based on the comparison of the edge density among vertices. Communities are 
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Figure 10: A network with community structure represented by the dashed lines. 
The communities are the groups of more intensely interconnected vertices. 



defined in a strong and a weak sense. In a strong sense, a subgraph is a community 
if all of its vertices have more connections between them than with the rest of the 
network. In a weak sense, on the other hand, a subgraph is a community if the sum 
of all vertex degrees inside the subgraph is larger than outside it. Though these def- 
initions are intuitive, one of their consequences is that every union of communities 
is also a community. To overcome this limitation a hierarchy among the commu- 
nities can be assumed a priori, as discussed by Reichardt and Bornholdt II 21 H . 
who defined community in networks as the spin configuration that minimizes the 
energy of the spin glass by mapping the community identification problem onto 
finding the ground state of a infinite range Potts spin glass 1122111231 . 

Another fundamental related problem concerns how to best divide a network 
into its constituent communities. In real networks, no information is generally 
available about the number of existing communities. In order to address this prob- 
lem, a measurement of the quality of a particular division of networks was proposed 
by Newman and Girvan I124L called modularity and typically represented by Q. 
If a particular network is split into c communities, Q can be calculated from the 
symmetric c x c mixing matrix E whose elements along the main diagonal, en, 
give the fraction of connections between vertices in the same community i while 
the other elements, (i ^ j) identify the fraction of connections between ver- 
tices in the different communities i and j. This is similar to the definition used to 
compute assortativity, Sectional The calculation of Q can then be performed as 
follows: 

Q = J>« - = TrE - \\E 2 \\, (62) 

» j 

The situation Q = 1 identifies networks formed by disconnected modules. This 
quantity has been used in many community-finding algorithms, as briefly reviewed 
in the following. 
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Though there are many ways to defined modularity, a generally accepted defi- 
nition of a module does not exist 11251 . The definitions described above estimate 
the modularity in terms of a given partitioning. Ziv et al. [126] proposed the modu- 
larity to be defined in terms of information entropy (see Section[8ll. This algorithm, 
which has been called the Network Information Bottleneck, tends to allow perfor- 
mance better than the algorithm based on betweenness centrality of Girvan and 
Newman. 

It should be noted that this review of community finding methods focus the 
subject of how specific network measurements have been adopted to identify the 
communities. Since we do not attempt to provide a comprehensive study of this 
important subject, the interested reader should refer to recent papers by Newman 
11271 and Danon et al. 11281 for further information and a more complete review 
of community finding methods. The following discussion has been organized into 
subsections according to the nature of the adopted methodology. 

11.1 Spectral Methods 

Spectral methods are based on the analysis of the eigenvectors of matrices derived 
from the networks 11291 . These methods have been discussed in a recent survey by 
Newman [ 1301 . The quantity measured corresponds to the eigenvalues of matrices 
associated with the adjacency matrix. These matrices can be the Laplacian matrix 
(also known as Kirchhoff matrix), 

L = D — A, (63) 

or the Normal matrix, 

A = D~ l A, (64) 

where D is the diagonal matrix of vertex degrees with elements da = J2j a ij > 

dij = for i ^ j. 

A particular method, called spectral bisection 1131 1 11321 1130B . is based on the 
diagonalization of the Laplacian matrix. If the network is separated into c discon- 
nected components, L will be block diagonal and have c degenerated eigenvectors, 
all corresponding to eigenvalue 0. However, if the separation is not clear, the diago- 
nalization of L will produce one eigenvector with eigenvalues and c — 1 eigenval- 
ues slightly different from 0. The spectral bisection considers the case when c = 2 
and the division of the network is obtained assigning positive components of the 
eigenvector associated with the second eigenvalue (the positive eigenvalue most 
close to 0) to one community and the negative ones to another community. Par- 
ticularly, the second eigenvalue, called algebraic connectivity, is a measurement 
of how good the division is, with small values corresponding to better divisions. 
Although spectral bisection is easy to implement, it tends to be a poor approach for 
detecting communities in real networks HI 301 - There are many alternative methods 
based on spectral analysis HI 33 1 . to be found in I127II1281 . 
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Recently, Newman 11341 proposed a method which reformulates the modu- 
larity concept in terms of the eigenvectors of a new characteristic matrix for the 
network, called modularity matrix. For each subgraph g, its modularity matrix 
has elements 



for vertices i and j in g. Thus, in order to split the network in communities, first the 
modularity matrix is constructed and its most positive eigenvalue and correspond- 
ing eigenvector are determined. According to the signs of the elements of this 
vector, the network is divided into two parts (vertices with positive elements are 
assigned to a community and vertices with negative elements to another). Next, the 
process is repeated recursively to each community until a split which makes zero 
or a negative contribution to the total modularity is reached. Following this idea, 
Newman proposed a new definition of communities as indivisible subgraphs, i.e. 
subgraphs whose division would increase the modularity. Currently, this method is 
believed to be the most precise, as it is able to find a division with the highest value 
of modularity for many networks 1 1 3411 - 

11.2 Divisive Methods 

In a divisive method, the underlying idea is to find the edges which connect differ- 
ent communities and remove them in a iterative form, breaking the network into 
disconnected groups of vertices. The computation of modularity can be used after- 
wards to determine the best division of the network. Next we give a brief descrip- 
tion of the most known divisive methods according to the adopted measurement 
used to choose the vertex to remove. 

11.2.1 Betweenness Centrality 

The most popular divisive method is the Girvan-Newman algorithm [10 j - Because 
different communities are connected by a small number of edges, this method 
considers that bottlenecks are formed at the edges which connect communities, 
through which all shortest paths should pass. In order to measure this traffic-related 
property in networks, the algorithm uses the concept of edge betweenness 1101 . see 
Section [9j Edges with high betweenness are progressively removed. After remov- 
ing each edge, the betweenness of each remaining edge must be calculated again. 

Although this algorithm represents a powerful alternative to determine com- 
munities (as shown in Figure [TTt. it has some disadvantages. The main one is its 
high computational cost. As discussed by Girvan and Newman flOl . the entire 
algorithm runs in worst-case time 0(M 2 N) on networks with M edges and N 
vertices. In order to overcome this limitation, some improvements in the algorithm 
were proposed including the Tyler's algorithm 11351 . which introduced a stochas- 




2M 
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tic element to the method, restricting the calculation of the betweenness only to a 
partial set of edges and using statistics to estimate the real betweenness. 



11.2.2 Edge Clustering Coefficient 

A different approach was proposed by Radicchi et al. 11201 (see also 11361 ). which 
is based on counting short loops of order / (triangles for I = 3) in networks. The al- 
gorithm is similar to Girvan and Newman's method, but instead of the betweenness 
centrality, it computes the edge clustering coefficient. This measurement is based 
on the fact that edges which connect communities tend to exhibit a small value for 
this coefficient. The clustering coefficient of edge is calculated as 



where Zy is the number of triangles to which belongs. This method can be 
generalized to more complex loops, e.g., squares. Though this method is simple 
and fast (0(M 4 /N 2 )), it fails whenever the network has a small average clustering 
coefficient, because the value of will be small for all edges. This suggests 
that the method will work well only when applied to networks with a high average 
clustering coefficient, such as social networks I128II . 

11.3 Agglomerative Methods 

Some networks are characterized by the fact that the vertices belonging to each 
community present similar features. So, it is in principle possible to obtain the 
communities by considering such similarities between vertices. In contrast to divi- 
sive methods, agglomerative approaches start with all vertices disconnected and 
then apply some similarity criterion to progressively join them and obtain the 
communities. It is interesting to note that this type of method presents a direct 
relationship with pattern recognition and clustering theory and algorithms (e.g., 
Ill37lll38lll39ll39ll ). which have been traditionally used in order to group individ- 
uals represented by a vector of features into meaningful clusters. 

11.3.1 Similarity Measurements 

One important family of agglomerative methods is known as hierarchical cluster- 
ing ll37U138ll39lfT3l . which starts with N vertices and no edges. Edges are added 
progressively to the network in decreasing order of similarity, starting with the pair 
with strongest similarity 1140111271 . To evaluate the similarity associated with edge 
a possibility is to use the so called Euclidian distance, given by 



Zi< + 1 



(66) 
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or the Pearson correlation between vertices as represented in the adjacency matrix, 
defined as 

PiO-j 

where m = ^ Ej a ij and of = ]v=i EjKi - Mi) 2 - 

Although this method is fast, the obtained division of the network is not gener- 
ally satisfactory for real networks, as discussed in 11271 . 



11.4 Maximization of the Modularity 

Newman 11241 11411 proposed a method based on joining communities in such 
a way as to maximize the modularity. In this method, two communities i and 
j are joined according to a measurement of affinity, given by the change of the 
modularity Q of the network (Eg. lo^t when the communities are joined 

AQij = 2 Uij j . (69) 

Thus, starting with each vertex disconnected and considering each of them 
as a community, we repeatedly join communities together in pairs, choosing at 
each step the joining that results in the greatest increase (or smallest decrease) in 
the modularity Q. This process can be repeated until the whole network is con- 
tained in one community. Currently, as discussed by Danon et al. 11281 . the New- 
man's method is believed to be the fastest one, running in 0{N log 2 N). Also, this 
method is more precise than the traditional method based on betweenness central- 
ity 11241 . However, as discussed by Danon et al. 11421 . the fast Newman's method 
has a limitation when the size of communities is not homogeneous, as a newly 
joined community i has the new values of in Eg. |69] increased, and tends to be 
chosen for new joining. In real networks the distribution of sizes of communities 
tends to follow a power law. So, this approach fails in many real networks. In order 
to overcome this limitation, it was proposed 11421 to normalize the value of A.Q 
by the number of edges in community i, 

AQ- - - — (e- - ^ (H ^ (ii ^ (70 ) 

This alteration on the local modularity makes the method more precise while 
not affecting its execution time. 



11.4.1 Extremal Optimization 

The extremal optimization method proposed by Duch and Arenas [ 143 ] is a heuris- 
tic search for optimizing the value of the modularity Q. The local modularity rep- 
resents the contribution of individual vertex i to the modularity Q. If q is the 
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Figure 11: Comparison of precision between the methods developed by Girvan 
and Newman [ 10j (based on removing edges with highest betweenness centrality 
value), the same method based on random walks 1144], and the method developed 
by Duch and Arenas, based on extremal optimization 11431 . Each point in this 
graph is an average of 100 realizations of networks with 128 vertices organized 
into 4 communities. 



community of vertex i, the local modularity is given by 



where are the elements of the community mixing matrix (page l30l and 5 is 
the Kronecker delta. In order to keep the value of this contribution in the interval 
[—1,1] and independent of vertex degree, it should be normalized by the degree of 
the vertex, i.e. qi = qi/ki. The value of qi is used as the, fitness for the extremal 
optimization algorithm. A heuristic search is performed to obtain the maximum 
value of the modularity. Initially, the network is split into two random partitions 
with the same number of vertices. After each step, the system self-organizes by 
moving the vertex with lowest fitness from one partition to another. The process 
stops when the maximum value of Q is reached. 

Although this method is not particularly fast, scaling as 0(N 2 logN), it can 
achieve high modularity values 11431 . By comparing the precision of some meth- 
ods as presented in Figure ^2 we can see that the extremal optimization method is 
more precise than the methods based on removing edges with highest betweenness 
centrality value. Moreover, it is clear that the computation of betweenness central- 
ity by counting the number of shortest paths passing through each edge is more 
precise than calculating this coefficient by random walks 11241 . 
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11.5 Local Methods 



More recently, some methods have been developed to detect the local community 
of a vertex based only on local information about the network topology. One such 
method was proposed by Bagrow and Bolt 11451 . which is based on the change of 
the hierarchical degree between two consecutive distances (see Section [T3b. Start- 
ing from a vertex vq, the vertices of successive hierarchical rings are added to the 
community, as long as the relation between the successive hierarchical degrees is 
greater than a specified threshold a 

> a. (72) 



When the expansion reaches a distance d for which the above condition fails, the 
community stops growing. 

Despite its favorable speed, this approach has an important limitation: the di- 
vision is precise only when vo is equidistant from all parts of its enclosing commu- 
nity's boundary 11461 . In order to overcome this drawback, it has been suggested 
11451 that the algorithm be executed N times starting from each vertex and then 
achieve a consensus about the detected communities. However, this approach in- 
creases the execution time of the algorithm. 

Another local method was proposed by Clauset 11461 which is based on com- 
puting the local modularity. The idea is that of a step-by-step growth of the com- 
munity together with the exploration of the network. The community C starts with 
only the original vertex vq. When a vertex is explored, a list of its neighbors is 
known. The set U is a list of all vertices that are not in C but are adjacent to some 
of its vertices; the set B (the boundary of C) is the subset of vertices in C that are 
adjacent of at least one vertex in U. The local modularity is defined as the ratio 
of the number of edges with one end point in B and neither end point in U to the 
number of edges with end points in B. Considering undirected networks, this can 
be written as 

R= ^ieB, j eC^ (?3) 

The algorithm consists in choosing iteratively from the set U the vertex that would 
result in the largest increase (or smallest decrease) in the value of R when added to 
C. The iteration stops when a pre-defined number of vertices was included in the 
community. 



11.6 Method Selection 

Despite the many interesting alternative methods, including those briefly reviewed 
above, it should be noted that the problem of community finding remains a chal- 
lenge because no single method is fast and sensitive enough to ensure ideal results 
for general, large networks, a problem which is compounded by the lack of a clear 
definition of communities. If communities are to be identified with high precision, 
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the spectral method proposed by Newman 11341 is a good choice. However, if pri- 
ority is assigned to speed, methods such as those using greedy algorithms (runs in 
0(N log 2 N)) should be considered 11411 . In brief, the choice of the best method 
to be used depends on the configuration of the problem and the kind of desired 
results f!28l 

One fact that should have become clear from our brief review of community 
finding approaches is the essential importance of the choice of the measurements 
adopted to express the separation of the communities. As a matter of fact, such 
measurements ultimately represent an objective definition of communities. There- 
fore, an interesting perspective for further research would be to consider the pos- 
sible adaptation and combination of some of the measurements reported in this 
survey with the specific objective of community characterization. 

11.7 Roles of Vertices 

After community identification, it is possible to determine the role of vertices 111 191 
by using the z-score of the within-module degree, z%, and the participation coeffi- 
cient, Pi. The z-score measures how "well-connected" vertex i is to the other 
vertices in the community, being defined by 



where is the number of connections i makes with other vertices in its own com- 
munity Si, q s . is the average of q over all vertex in Sj, and a qs . is the standard 
deviation of q in Sj. 

The participation coefficient measures how "well-distributed" the edges of ver- 
tex i are among different communities, 



where qi s is the number of edges from vertex i to community s and ki is the degree 
of vertex i. This value is zero if all edges are within its own community and 
it is close to one if its edges are uniformly distributed among all communities. 
Based on these two index, a zP parameter-space can be constructed, allowing the 
classification of vertices into different roles (see e.g., 11191 ). 

12 Subgraphs 

A graph g is a subgraph of the graph G if M(g) C J\f(G) and £{g) C £{G), with 
the edges in £{g) extending over vertices in N(g). If g contains all edges of G that 
connect vertices in M{g), the subgraph g is said to be implied by N{g). Impor- 
tant subgraphs include loops, trees (connected graphs without loops) and complete 
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(a) (b) (c) 



Figure 12: A network such as that in (a) includes several subgraphs, such as cycles 
(b) and trees (c). 



Figure 13: In a real network (a), the number of motifs (represented here by three 
vertices linked by dashed lines) is greater than in an equivalent random network 



subnetworks (cliques). Figure El shows a network and some subnetworks. The 
probability distribution of subgraphs in random graphs has been studied for some 
time [28 1, but interest has increased recently as a consequence of the discovery of 
network motifs as discussed below. 

12.1 Network Motifs 

Network motifs are subgraphs that appeal - more frequently in a real network than 
could be statistically expected 11471 11481 11491 (see Figure IT3l. Figure [141 shows 
some possible motifs of directed networks and their conventional names. To find 
the motifs in a real network, the number of occurrences of subgraphs in the net- 
work is compared with the expected number in the ensemble of networks with the 
same degree for each vertex. A large number of randomized networks from this 
ensemble is generated in order to compute the statistics of occurrence of each sub- 
graph of interest. If the probability of a given subgraph to appear at least the same 
number of times as in the real network is smaller than a given threshold (usually 
0.01), the subgraph is considered a motif of the network. 

In order to quantify the significance of a given motif, its Z-score can be com- 
puted. If Nf^' is the number of times that a motif i appears in the real network, 
(A^ rand ' ) ) the ensemble average of its number of occurrences, and (j( raild ) th e s tan- 
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Figure 14: Some types of motifs: (a) three-vertex feedback, (b) three chain, (c) 
feed-forward loop, (d) bi-parallel, (e) four-vertex feedback, (f) bi-fan, (g) feedback 
with two mutual dyads, (h) fully connected triad and (i) uplinked mutual dyad. 



dard deviation of the number of occurrences, then: 



N 



(real) 



(rand) 



(76) 



It is also possible to categorize different networks by the Z-scores of their 
motifs: networks that show emphasis on the same motifs can be considered as part 
of the same family 11501 . For this purpose, the significance profile of the network 
can be computed. The significance profile is a vector that, for each motif of interest 
i, is used to compute the importance of this motif with respect to other motifs in 
the network: 

SP i = v^W (77) 

It is interesting to note that motifs are related to network evolution. As de- 
scribed by Milo et al. 11481 . different kinds of networks present different types 
of motifs e.g., for transcription networks of Saccharomyces cerevisiae and Es- 
cherichia coli two main motifs are identified: feed-forward loop and bi-fan; for 
neurons: feed-forward loop, bi-fan and bi-parallel; for food-webs: three chain and 
bi-parallel; for electronic circuits: feed-forward loop, bi-fan, bi-parallel, four-node 
feedback and three node feedback loop; and for the WWW: feedback with two 
mutual dyads, fully connected triad and uplinked mutual dyad. Thus, motifs can 
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be considered as building blocks of complex networks and many papers have been 
published investigating the functions and evolution of motifs in networks (l5i . 



12.2 Subgraphs and Motifs in Weighted Networks 

In weighted networks, a subgraph may be present with different values for the 
weights of the edges. Onnela et al. 17211 suggested a definition for the intensity of 
a subgraph based on the geometric mean of its weights on the network. Given a 
subgraph g, its intensity is defined by 

l/n a 

1(g) = I via I , (78) 




where n g = \£(g) \ is the number of edges of subgraph g. 

In order to verify whether the intensity of a subgraph is small because all its 
edges have small weight values or just one of the weights is too small, the co- 
herence of the subgraph \f, defined as the ratio between geometric and arithmetic 
mean of its weights, can be used: 

*(9) = " (gK ■ (79) 

All possible subgraphs of the weighted graph can be categorized into sets of 
topologically equivalent subgraphs. 1 Let M be one such set of topologically equiv- 
alent subgraphs. The intensity of M is given by Im = J2 g eM ^0?) an< ^ coher- 
ence by ^ m = YlgeM ^(fiO- An intensity score ZIm can be accordingly defined 
by 

ZIm = M ~r< ] (80) 

(rand) 



Im 



and the coherence score, 



VI/ „, /rf/( rand )\ 

z* M = M ~r^ - i 

(rand 

where (I M a ) and crf^ nd ^ are the mean and the standard deviation of the inten- 
sities in a randomized graph ensemble; (^m ) and are the average and 
the standard deviation of the coherence in the randomized ensemble. When the 
network is transformed to its unweighted version, ZIm and Z^m tend to Z (see 
Eq. O). 



'Two subgraphs are topologically equivalent if their only difference is on the weight of the exist- 
ing edges. 
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12.3 Subgraph Centrality 



A way to quantify the centrality of a vertex based on the number of subgraphs in 
which the vertex takes part has been proposed 11511 . The respective measurement, 
called subgraph centrality, considers the number of subgraphs that constitute a 
closed walk starting and ending at a given vertex i, with higher weights given to 
smaller subgraphs. This measurement is related to the moments of the adjacency 
matrix, Eq. doTT) : 

where (A k )a is the ith diagonal element of the /cth power of the adjacency matrix 
A, and the factor k\ assures that the sum converges and that smaller subgraphs have 
more weight in the sum. Subgraph centrality can be easily computed 1 151] from 
the spectral decomposition of the adjacency matrix, 

N 

SC i = Y t v j (i) 2 e^, (83) 



where Xj is the jth eigenvalue and vj(i) is the ith element of the associated eigen- 
vector. This set of eigenvectors should be orthogonalized. The subgraph centrality 
of a graph is given by 1^51 : 

N N 



13 Hierarchical Measurements 

Using concepts of mathematical morphology H52lll53lll54l[T55l . it is possible to 
extend some of the traditional network measurements and develop new ones 11561 
I157M1581 . Two fundamental operations of mathematical morphology are dilation 
and erosion (see Figure [T5l. Given a subgraph g of a graph G, the complement of 
g, denoted g, is the subgraph implied by the set of vertices in G that are not in g, 

M{g)=N{G)\N(g) 

(\ is the operator of set difference). The dilation of g is the subgraph 5(g) implied 
by the vertices in g plus the vertices directly connected to a vertex in g. The erosion 
of g, denoted e(g), is denned as the complement of the dilation of the complement 
of g: 

These operations can be applied repeatedly to generate the d-dilations and d-erosions: 

5 d (g) = S(S(...(g)...)), (85) 
.. ' 
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(a) 



(b) 



Figure 15: Example of morphological operations: (a) Dilation: the dilation of 
the initial subnetwork (dark gray vertices) corresponds to the dark and light gray 
vertices; (b) Erosion: the erosion of the original subnetwork, given by the dark 
gray vertices in (a), results in the subnetwork represented by the black vertices in 
(b). 

e d {g) = e{e{...{g)...)) . (86) 

d 

The first operation converges to the entire network G and the second converges to 
an empty network. 

The d-ring of subgraph g, denoted Rd(g), is the subgraph implied by the set of 
vertices 

Af(5 d (g))\M(6 d - 1 (g)); 
the rs-ring of g, denoted R rs (g), is the subgraph implied by 

Af(S s (g))\M(S r ^(g)). 

Note that Rd{g) = Rdd{g)- The same definitions can be extended to a single ver- 
tex considering the subgraph implied by that vertex, and to an edge considering 
the subgraph formed by the edge and the two vertices that it connects. In the 
case of a single vertex i the abbreviations Rd(i) and R rs (i) are used. For ex- 
ample, in Figure^! i?i(15) includes the vertices {8, 14, 16, 17}; i?2(15) includes 
{1, 13, 18, 19}; for the graph g implied by the vertices {1, 15, 22} (in black), Ri(g) 
includes the vertices in white: {2, 3, 4, 5, 6, 7, 8, 9, 14, 16, 17}. 

The hierarchical degree of a subgraph g at distance d, henceforth represented as 
kd(g), can be defined as the number of edges connecting rings Rd{g) to Rd+i(g). 
Note that ko(i) is equal to hi. 

Another measurement which can be hierarchically extended is the clustering 
coefficient. The rs-clustering coefficient of g, C rs (g), can be defined as the num- 
ber of edges in the respective rs-ring n rs , divided by the total of possible edges 
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Figure 16: The subgraph of interest is defined by black vertices, g = {1, 15, 22}. 
The first hierarchical level of g is given by the first dilation around g, represented by 
the white vertices; the second hierarchical level is obtained dilating the subnetwork 
again, represented by the gray vertices. The hierarchical degree of the first level is 
given by the number of edges from white to light gray vertices, k\{g) = 12, and 
the hierarchical degree of the second level is the number of edges from light gray 
to dark gray vertices, foG?) = 2. 



between the vertices in that ring, i.e. for undirected networks 

r (n \ = 2n rs {g) 

Urs[9) \Af(R rs (g))\(\M(R rs (g))\-lY 16 ) 

Other possible hierarchical measurements are briefly described in the follow- 
ing. The convergence ratio at distance d of subgraph g, cvd(g), corresponds to the 
ratio between the hierarchical subgraph degree at distance d — 1 and the number 
of vertices in the ring at distance d; it can be understood as the average number of 
edges received by each vertex in the hierarchical level d from the previous level, 

c,,d(9) = wm)\- <88) 

It is also possible to define the divergence ratio, which corresponds to the recipro- 
cal of the convergence ratio 

dvM= m^i. (89) 

kd-i{9) 

The intra-ring degree is obtained by taking the average among the degrees of 
vertices in the subnetwork Rd(g)', note that only internal ring edges are consid- 
ered. On the other hand, the inter-ring degree is defined by the average number of 
connections between vertices in ring Rd(g) and those in Rd+i(g). 



14 Fractal Dimensionality 

Fractals are objects or quantities that display self-similarity (or self-affinity) in all 
scales. For complex networks, the concept of self-similarity under a length-scale 
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transformation was not expected because of the small world property, implying 
the average shortest path length of a network increases logarithmically with the 
number of vertices. However, Song et al. 11591 analyzed complex networks by 
using fractal methodologies and verified that real complex networks may consist 
of self-repeating patterns on all length scales. 

In order to measure the fractal dimension of complex networks, a box counting 
method and a cluster growing method has been proposed 11591 . In the former, the 
network is covered with Nb boxes, where all vertices in each of them are connected 
by a minimum distance smaller than Ib- Nb and Ib are found to be related by 

N B ~r B dB i (90) 

where ds is the fractal box dimension of the network. 

For the cluster growing method, a seed vertex is chosen at random and a cluster 
is formed by vertices distant at most I from the seed. This process is repeated many 
times and the average mass of resulting clusters is calculated as a function of I, 
resulting in the relation 

(M c )~l d f, (91) 

where the average mass (M c ) is defined as the number of vertices in the cluster and 
df is the fractal cluster dimension. 

For a network whose vertices have a typical number of connections, both ex- 
ponents are the same, but this is not the case for scale-free networks. 

Another scaling relation is found with a renormalization procedure based on 
the box counting method 11591 . A renormalized network is created with each box 
of the original network transformed into a vertex and two new vertices are con- 
nected if at least one edge exists between vertices of the corresponding boxes in 
the original network. By considering the degree k' of each vertex of the renormal- 
ized network versus the maximum degree k in each box of the original network we 
have that: 

k'^l B dk k, (92) 

The exponents 7 (of the power law of the degree distribution), ds and d^ are 
related by fT59ll : 

j = l + d B /d k . (93) 

Thus, scale-free networks, characterized by the exponent 7, can also be described 
by the two length invariant exponents ds and dk- 

15 Other measurements 

This section describes additional, complementary measurements related to network 
complexity, edge reciprocity and matching index. 
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15.1 Network Complexity 



It might be of interest to quantify the 'complexity' of a network. Lattices and other 
regular structures, as well as purely random graphs, should have small values of 
complexity. Some recent proposals are briefly presented below. 

Machta and Machta 11601 proposed the use of the computational complexity of 
a parallel algorithm 1161 H for the generation of a network as a complexity measure- 
ment of the network model. If there is a known parallel algorithm for the generation 
of the network of order 0(f(N)), with f(x) a given function, then the complex- 
ity of the network model is defined as 0(f(N)). For example, Barabasi-Albert 
networks can be generated in C(log log N) parallel steps 11601 . 

Meyer-Ortmanns 11621 associated the complexity of the network with the num- 
ber of topologically non-equivalent graphs generated by splitting vertices and parti- 
tioning the edges of the original vertex among the new vertices, the transformations 
being restricted by some constraints to guarantee the generation of valid graphs. 

The off-diagonal complexity, proposed by Claussen 11631 is defined as an en- 
tropy of a specially defined vertex- vertex edge correlation matrix. An element with 
indexes (k,l) of this matrix has contributions from all edges that connect a vertex 
of degree k to a vertex of degree I (only values k > I are used). 



15.2 Edge Reciprocity 

For directed networks, it is often interesting to know if their edges are reciprocal, 
i.e. if vertex i is linked to vertex j, is vertex j also linked to vertex i? Such infor- 
mation helps to obtain a better characterization of the network, can be used to test 
network models against real networks and gives indication of how much informa- 
tion is lost when the direction of the edges is discarded (e.g. for the computation 
of some measurements that only apply to undirected networks). 

A standard way to obtain information about reciprocity is to compute the frac- 
tion of bilateral edges: 

Q = 3 M , (94) 

where M is the total number of edges. 

A problem with this measurement is that its value is only relevant with respect 
to a random version of the network, as networks with higher connectivity tend 
to have a higher number of reciprocal edges due exclusively to random factors. 
Garlaschelli and Loffredo 11641 proposed the use of the correlation coefficient of 
the adjacency matrix: 

EijKj -^(aji -a) 

where a is the mean value of the elements of the adjacency matrix. This expression, 
known as edge reciprocity, simplifies to 

P = f4. (96) 
1 — a 
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This value is an absolute quantity, in the sense that values of p greater than zero im- 
ply larger reciprocity than the random version {reciprocal networks), while values 
below zero imply smaller reciprocity than a random network {antireciprocal net- 
works). This concept can be easily extended to weighted networks by substituting 
dij for Wij in the above expressions. 



15.3 Matching Index 

A matching index can be assigned to each edge in a network in order to quantify 
the similarity between the connectivity of the two vertices adjacent to that edge 
HI 65 1 . A low value of the matching index identifies an edge that connects two dis- 
similar regions of the network, thus possibly playing an important role as a shortcut 
between distant network regions II165II . The matching index of edge is com- 
puted as the number of matching connections of vertices i and j (i.e. connections 
to the same other vertex k), divided by the total number of connections of both 
vertices (excluding connections between % and j), 

Mij = gj^ff^ , (97) 

For directed networks, matching connections are only those in the same direction, 
and incoming and outgoing connections of vertices i and j should be considered 
separately. The matching index has also been adapted to apply to consider all the 
immediate neighbors of a node, instead of a single edge II166II . 



16 Measurements of Network Dynamics and Perturba- 
tion 

This section covers two important related issues, namely the use of trajectories to 
characterize the dynamical evolution of complex networks connectivity and a brief 
discussion about the sensitivity of measurements to perturbations. 



16.1 Trajectories 

As motivated in the Introduction of this work, in the following we analyze the 
behavior of trajectories (see Figure [T7b defined by tuples of measurements as the 
analyzed network undergoes progressive modification, such as during their growth. 
The network models considered for the illustration of trajectories include: Erdos- 
Renyi random graphs (ER), random networks with community structure (CN), 
Watts-Strogatz small-worlds (WS), Geographical Networks (GN), and Barabasi 
and Albert scale-free networks (BA) (see section|3]for a description of these mod- 
els). The number of vertices considered was 250, 500, 1000 and 2000, and the 
number of edges varies so that the average vertex degree ranges from 4 to 204, 
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increasing by steps of 20. In the case of the GN model, the vertices were ran- 
domly distributed through a square box of unit size. The A parameter in Eq. [O] 
was adjusted in order to guarantee the desired average degree. The CN networks 
include four communities interconnected by using p ou t/Pm (see Section |3j) equal 
to 5%, 10% and 15%. In the WS model, the probability p of rewiring the edges 
was 0.0002, 0.02 and 0.1. For the sake of better visualization, the trajectories of 
the WS and CN models are drawn separately from the other cases. The direction 
of evolution of the trajectories as more edges are included is indicated by arrows in 
Figures [n] These results are discussed subsequently with respect to several pairs 
of measurements. 

16.1.1 Average Clustering Coefficient and Average Shortest Path Length 

By inspecting the trajectories associated to this pair of measurements (see Fig- 
ure rrTT a)). two distinct behaviors can be identified. First, the average clustering 
coefficient C exhibits a high variation while the average shortest path length I 
remains almost constant with addition of edges for the ER, CN and BA models. 
Second, an opposite effect is observed for GN and WS models. In the latter case, 
the £ value undergoes a steep decrease, while staying almost constant for the other 
network models. This effect is related to the fact that GN and WS models are 
formed by vertices that tend to link to closer neighbors. Hence, with the addition 
of edges some long-range connections may decrease £ while C remains almost un- 
changed. Furthermore, t decreases faster for WS model than for GN while C for 
the former model remains larger than in the other cases. This can be explained by 
the fact that the WS model is more regular than the GN and has larger C. 

In the case of ER, CN and BA models, the values of I and C are smaller than 
for the other models. For I the connections are not limited by proximity, adjacency 
or geography, and for C loops of order three appear when new edges are added to 
them, increasing this measurement. 

Another interesting fact observed from Figure [FTT a) is that all curves converge 
to the same point, corresponding to fully connected graphs, as the networks become 
denser. Therefore, I and C tend in this stage to a unit value. 

16.1.2 Average Clustering Coefficient and Average Hierarchical Clustering 
Coefficient of Second Level 

The combination 2 of C and the average hierarchical clustering coefficient of sec- 
ond level (C2(i)), where the average is taken over all vertices in the network, see 
Figure [TTT b). tend to follow a power law for all trajectories except for the GN and 
WS models, whose curves have a minimum value for ((^(i)). Nonetheless, the 
highest growth rate is observed for the trajectories of WS model after their mini- 
mum value of (C2{i)) is reached. 

2 Note that C is the same as the average hierarchical clustering coefficient of first level. 
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Figure 17: Trajectories defined by pairs of measurements. Each point corresponds to 10 network model realizations. Network sizes used 
are 250, 500, 1000, and 2000; average degrees vary from 4 to 204 in steps of 20; for the community model, p ut/Pm is 5%, 10%, and 
15%; for the WS model the rewiring probability values are 0.0002, 0.02, and 0.1. 



Another interesting characteristic of this combination of measurements is that 
C is greater than (C2(i)). This can be explained by the fact of (C2{i)) is related 
to the presence of loops of order five without additional connections between their 
vertices I158H . Since loops of higher orders are less likely to appeal - in the consid- 
ered networks, C is larger than (62(2)). 

16.1.3 Pearson Correlation Coefficient and Central Point Dominance 

For all network models, except for the GN, the r value is close to zero even with 
the addition of new edges, as can be seen in Figure [T7l c). which shows the tra- 
jectories defined by the pair of measurements Pearson correlation of the degrees 
r (Section |6j and central point dominance CPD (Section |9j) as the average de- 
gree increases. This property can be explained by the fact that in ER, CN and WS 
models edges are placed without regard to vertex degree, while the BA model is 
based on growth l%3l . which leads to non-assortative mixing (i.e. no correlation 
between vertex degrees). The r value for the GN model is greater than zero in 
almost all cases because its growing dynamics is based on the geographic proxim- 
ity of vertices. As the position of vertices is randomly chosen, some regions may 
result highly populated, implying the respective vertices to have a high probability 
of becoming highly interconnected. On the other hand, vertices belonging to the 
regions barely populated have a small likelihood to become "hubs" while still hav- 
ing a good chance of being connected. These two opposite behaviors lead to a r 
value greater than zero. 

The central point dominance is a measurement of the maximum betweenness 
of any point in the network 11081 (see Section[9]for further details). By observing 
Figure [T7T c). one can see that most network models exhibit average values of this 
measurement close to zero, except for the BA, GN and WS cases. In BA networks, 
values significantly larger than zero only occur in the beginning of the growth 
process (i.e. with few edges). 

For WS models, the way in which they are normally constructed (see Section^ 
directly contributes to producing a network with modular structure, hence a high 
CPD value. Nevertheless, when new edges are added, the network gets denser and 
the value of this measurement goes to zero. In CN models, the CPD coefficient 
depends on the relation between the average vertex degrees inside and outside com- 
munities, i.e. when the network is highly modular, the CPD value tends to become 
larger. 

16.1.4 Average Hierarchical Degree of Second Level and Average Hierarchi- 
cal Divergence Ratio of Third Level 

As shown in Figure [PTT d). all curves obtained for the average hierarchical degree 
of second level 3 (&2 W) an d the average hierarchical divergence ratio of level three 

3 Notice that (A-2 (i)) (average taken over all vertices i in the network) depends on the network 
connectivity. 
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(dv^ii)} have similar behavior. When the networks are sparse and new edges are 
added to them increasing the average vertex degree, the average hierarchical coun- 
terpart increases until a maximum value. Afterwards, since the networks have a 
finite size, further increase of the connectivity tends to reduce the number of hier- 
archical levels in the networks and, as consequence, the average hierarchical vertex 
degree of levels higher than one tends to decrease. The hierarchical divergence ra- 
tio of level three decreases with larger average vertex degree. 

16.1.5 Discussion 

As presented in Figure El ea °h measurement is specifically sensitive to the effects 
of addition of new edges to a network. Interestingly, the sensitivity also depend 
strongly on the network model. Some trajectories resulted closer one another for 
specific network models as a consequence of inherent structural similarities. This 
effect is particularly pronounced in trajectories defined by the average clustering 
coefficient and average shortest path length, where two classes of trajectories ap- 
pear, one for ER, BA and CN, and another for GN and WS. 

The analysis of network dynamics provides insights about model similarities. 
If network trajectories evolve in a similar fashion, it is possible to infer that these 
networks have similar structure concerning the respective pair of measurements. 
However, for other measurements, this similarity may be weaker or non-existent. 
For instance, in the space defined by the average clustering coefficient and average 
shortest path length, the curves obtained for RA and BA evolve in similar fash- 
ion. This behavior is not observed in the space defined by the central point dom- 
inance and Pearson correlation coefficient. Also, by inspecting the trajectories, it 
is possible to determine the correlation between measurements during the network 
evolution. For instance, the dynamics of the average clustering coefficient and the 
average hierarchical clustering coefficient of second level present correlation for 
ER and CN. 

The trajectory-based study described here can be immediately extended to real 
network analysis and modeling. In the case of the WWW, for instance, by in- 
specting its evolution in the measurements space it is possible to develop more 
precise models to represent and characterize its structure. For citation networks, 
it is possible to characterize the networks generated for different knowledge areas 
and obtain insights about their structure and evolution. All in all, trajectories pro- 
vide a visually clear and accessible interpretation and dynamics about the evolution 
of complex networks connectivity. 

16.2 Perturbation Analysis 

Another important property of a given measurement relates to how much it changes 
when the networks undergo small perturbations (e.g., rewiring, edge or vertex at- 
tack, weight changes, etc.). For instance, the shortest path length provides a good 
example of a particularly sensitive measurement, in the sense that the modification 
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Figure 18: Example of perturbations. Each point conesponds to 100 realizations 
of networks with N = 1000 and (k) = 6. 

of a single connection may have great impact in its value. The quantification of the 
sensitivity of measurements to different types of perturbations and networks there- 
fore provides valuable information to be considered for characterization, analysis 
and classification of complex networks. 

Interesting insights can be obtained as far as this subject is concerned by per- 
forming progressive perturbations in a specific network and observing the respec- 
tive relative variations in the measurement of interest. Figure ED shows the trajec- 
tories obtained in the {C, £} and {C, CPD} feature spaces shown in Figure IX8V a) 
and (b), respectively, while considering edge randomization 15311 (see also Sec- 
tion l3.3t progressively performed for the BA, ER, GN and WS models (N = 1000, 
(k) = 6). The values shown in this figure were normalized through division by the 
respective averages of the measurements in order to provide suitable visual com- 
parison. 

Each successive point along the trajectories, which are indicated by arrows in 
Figure was obtained after a number of rewirings which are successive inte- 
ger powers of two (for the sake of obtaining more uniform visualization). It is 
clear from the results that the sensitivity of the two pairs of measurements varies 
substantially with respect to the type of network under consideration. More specifi- 
cally, much wider variations were observed for the GN and WS models. The stable 
trajectories obtained for the shortest path length in the case of the BA model is 
a direct consequence of the fact that these networks are inherently characterized 
by low overall average shortest path length. A similar situation is verified for the 
clustering coefficient, which tends to be small in those two types of networks. The 
evolution of the trajectory regarding the clustering coefficient and average shortest 
path length for the GN and WS networks is a direct consequence of the fact that 
the progressive edge rewiring tend to strongly reduce those two measurements. 
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The marked difference of sensitivity of measurements to perturbations depending 
on the type of network model suggests that quantifications of the sensitivity (e.g. 
the standard deviation or entropy) can be potentially useful as measurements for 
network identification. 

17 Correlations Analysis 

In order to investigate possible linear relationships between several of the measure- 
ments considered, we calculated them for the BA, ER and GN models (N = 1000 
and {k} = 4) and estimated the respective pairwise correlations. Table |3] shows 
the values obtained with respect to each model and also considering all models 
together ('All'). 

Several interesting facts can be inferred from this table. First, particularly high 
absolute values of correlations have been obtained for the BA model, with low ab- 
solute values observed for the ER and GN cases. This seems to represent a partic- 
ularly interesting property of the BA networks. Another interesting finding regards 
the fact that the correlations obtained for specific network models not necessarily 
agree with that obtained when the three models are considered together. This is the 
case, for instance, of the low correlation observed between the measurement aver- 
age shortest path length £ and log-log degree distribution straightness st for each 
of the three individual models and high correlation otherwise obtained when these 
three models are considered jointly. This interesting behavior can be immediately 
understood from Figure [^J which indicates that the three low correlations groups 
obtained for the individual models tend to align globally, therefore implying the 
relatively strong negative correlation. Such situations indicate that the individual 
and global correlations provide information about different types of relationships 
and should be treated accordingly. 

It is also clear from the results in Table|3]that particularly high correlations were 
obtained between the average shortest path length i and the vertex degree at the 
second hierarchical level (^(i)). This fact suggests that this specific hierarchical 
vertex degree may be considered, at least for the three types of networks, as an 
estimation of the average shortest path length, allowing substantial computational 
saving. Another interesting result is that the highest correlations were obtained 
for the BA model, a possible consequence of the respective hubs. For instance, 
the correlation between the average shortest path length and the average clustering 
coefficient was equal to -0.63 for the BA models. This is a consequence of the fact 
that additional links tend to be established with the hubs and therefore contribute 
to higher clustering and shortest paths. 
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Table 3: Correlation between measurements for the BA, ER and GN models and 



"All" jointly. Values estimated from 1000 realizations (for each model) of networks 
with N = 1000 and {k) = 4. 







st 


r 


C 


i 


CPD 


(M*)> 


<C 2 (i)> 


(dv 3 (i)) 




BA 


1.00 
















St 


ER 


1.00 


















GN 


1.00 


















All 


1.00 


















BA 


-0.22 


1.00 














r 


ER 


-0.01 


1.00 
















GN 


-0.13 


1.00 
















All 


0.71 


1.00 
















BA 


0.06 


-0.29 


1.00 












c 


ER 


-0.01 


0.07 


1.00 














GN 


0.04 


-0.00 


1.00 














All 


0.31 


0.82 


1.00 














BA 


-0.01 


0.38 


-0.63 


1.00 










I 


ER 


-0.06 


0.04 


-0.08 


1.00 












GN 


-0.10 


0.02 


0.03 


1.00 












All 


0.69 


0.96 


0.88 


1.00 












BA 


-0.09 


0.23 


0.39 


-0.58 


1.00 








CPD 


ER 


-0.61 


0.10 


0.03 


0.07 


1.00 










GN 


-0.05 


-0.02 


0.03 


0.23 


1.00 










All 


-0.87 


-0.44 


0.02 


-0.41 


1.00 










BA 


0.01 


-0.30 


0.63 


-0.99 


0.60 


1.00 






(fe(i)> 


ER 


0.04 


0.03 


0.08 


-0.90 


-0.06 


1.00 








GN 


0.08 


0.28 


-0.02 


-0.65 


-0.13 


1.00 








All 


-0.96 


-0.80 


-0.43 


-0.79 


0.85 


1.00 








BA 


0.02 


0.02 


0.58 


-0.74 


0.59 


0.76 


1.00 






ER 


-0.03 


0.04 


0.45 


-0.16 


0.02 


0.19 


1.00 






GN 


-0.00 


0.09 


0.59 


0.18 


0.07 


-0.11 


1.00 






All 


0.37 


0.86 


0.99 


0.91 


-0.05 


-0.49 


1.00 






BA 


0.01 


0.26 


-0.57 


0.91 


-0.52 


-0.94 


-0.69 


1.00 


(*s(0> 


ER 


0.03 


-0.10 


-0.01 


-0.25 


-0.01 


-0.16 


-0.04 


1.00 




GN 


-0.02 


-0.28 


-0.09 


-0.03 


-0.00 


-0.50 


-0.21 


1.00 




All 


-0.14 


-0.74 


-0.97 


-0.79 


-0.18 


0.27 


-0.96 


1.00 



53 




-1.00 -0.95 -0.90 -0.85 -0.80 -0.75 

log-log degree density straightness 



Figure 19: Example of scatterplot showing the low correlation between log-log 
degree density straightness and the average shortest path length for all the individ- 
ual models and the high correlation for all models together. The networks have 
N = 1000 and (k) = 4; 500 realizations of each model were used. 

18 Multivariate Statistical Methods for Measurement Se- 
lection and Network Classification 

The intrinsic statistical variability of the connectivity of real and simulated com- 
plex networks, even when produced by the same process or belonging to the same 
class, implies that sound characterization, comparison and classification of net- 
works should take into account not only the average measurements, but also ad- 
ditional information about their variability including higher statistical moments 
(e.g., variance, kurtosis, etc.) as well as multivariate statistical distribution of the 
measurements. For example, realizations obtained by using the Barabasi-Albert 
(BA) model with fixed parameters will produce networks which, though not iden- 
tical, will have equivalent statistical distribution of their properties. Figures |2U] 
shows a scatterplot obtained by considering 1000 realizations of the BA model 
with N = 1000 and m = 3 with respect to the measurements (r, C, £), where r is 
the Pearson correlation coefficient of vertex degrees; C, the average clustering co- 
efficient; and t, the average shortest path length. Although the obtained points form 
a well-defined cluster around the average point (—0.0653, 0.0365, 3.255), there is 
a significant dispersion of cases around this center, implying that additional mea- 
surements other than the mean need to be used for proper characterization of the 
network under analysis. Therefore, any objective attempt at characterizing, com- 
paring or classifying complex networks needs to take into account distributions in 
phase spaces such as that in Figure |20| Such an important task can be effectively 
accomplished by using traditional and well-established concepts and methods from 
Multivariate Statistics (e.g., l39l l40l .McLachlan :04) and Pattern Recognition (e.g., 
!39lfl67ll40T). 
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Figure 20: The spatial distribution in the (r,C,£) phase space of 1000 different 
realizations of the BA model with N = 1000 and m = 3. The distribution has 
been projected into the three main planes (gray shadows) for the purpose of better 
visualization. The white circles in the middle of the gray shadows represent the 
mean projected into these planes. 

As far as the choice and interpretation of network measurements are concerned, 
two multivariate methods stand out as being particularly useful, namely Principal 
Component Analysis — PCA (e.g. 13911401 1 and Canonical Variable Analysis (e.g., 
[ 168 1). While the former procedure allows the reduction of the dimensionality of 
the measurement phase space, obtained in terms of projections so as to concentrate 
the variation of the data along the first axes (i.e. those associated to the highest 
covariance matrix eigenvalues), the second method implements such projections 
so as to achieve best separation, in terms of inter and intra-class distances (see 
below), between the involved classes of networks under analysis. 

Another situation in multivariate statistics which is particularly important for 
complex network research concerns network identification. Indeed, it is often a 
critical issue to decide to which of several models a given theoretical or experi- 
mentally obtained network belongs. This important problem can be approached in 
a sound way by using Bayesian decision theory l40l . a well-established method- 
ology which, provided we have good probabilistic models of the properties of the 
networks, allows near-optimal classification performance 4 . 

The current section presents and illustrates in a self-contained and accessible 
fashion these two families of methods from multivariate statistics, i.e. dimension- 
ality reduction (PCA and canonical analysis) and classification (Bayesian decision 
theory). The possibility to apply Bayesian decision theory on phase spaces ob- 

4 Optimal performance is guaranteed provided the involved mass and conditional properties are 
perfectly known (see Section fTOl and 13911401 ). 
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tained through canonical variable analysis projections of multidimensional mea- 
surement spaces is also illustrated. The potential for applications of these methods 
is illustrated with respect to three reference complex network models — namely 
Erdos and Renyi random graph (ER), Barabasi- Albert (BA) and Geographical Net- 
work model (GN), against which some real-world networks are classified. 

It should be observed that many other methods from multivariate statistical 
analysis, including hierarchical clustering and structural equation modeling, can 
also be valuable for investigations in complex network research. Though the po- 
tential of hierarchical clustering for suggesting relationships between the classes is 
briefly illustrated in the following, further information about such methods can be 
found in textbooks such as fT67ll40ll39l lT69lll7()lll68l . 



18.1 Principal Component Analysis 

Let the connectivity properties of a set of R complex networks, irrespective of their 
type or origin, be described in terms of P scalar measurements Xi, i = 1, 2, . . . , P, 
organized as the feature vector x = (x\,X2, ■ ■ ■ ,xp) T . The covariance matrix K 
can be estimated as 

(x — {x))(x — (x)) T 



K 



R 



(98) 



where (x) is the average feature vector, each element of which corresponds to the 
average of the respective measurement. As K is a real and symmetric PxP matrix, 
a set of P decreasing eigenvalues Aj and respectively associated eigenvectors vl 
can be obtained. Moreover, if all eigenvalues are distinct, the eigenvectors will 
be orthogonal 5 . These eigenvectors can be stacked to obtain the transformation 
matrix T, i.e. 



Vl 

■ v 2 ■ 

Vp 



(99) 



The original feature vectors x can now be transformed into a new coordinates 
reference through the following linear transformation corresponding to axes rota- 
tion: 

X = Tx (100) 

which defines the principal component projections. 

It can be shown 11701 that the distribution of points in the new phase space 
obtained by the above transformation is such that the largest variance is observed 
along the first axis, followed by decreasing variances along the subsequent axes, 
with the initial axes called principal. Such an important property allows, by con- 
sidering only the principal eigenvalues, the original cloud of points to be projected 



" Otherwise, orthogonal eigenvectors can still be assigned to repeated eigenvalues. 
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Figure 21: The principal component projection of the distribution of measurements 
in Figure |20| Measurement values were first normalized by subtracting the corre- 
sponding mean value and dividing by the standard deviation, to avoid distortions 
due to the different absolute values. The first and second PCA variable have pro- 
jecting vectors (-0.005, 0.707, -0.707) and (0.006, 0.707, 0.707) in the space de- 
fined by (r, C,£), respectively. 



along phase spaces of a smaller dimension p. In order to do so, the transforma- 
tion matrix is constructed while taking into account only the first T p eigenvectors 
associated to the largest eigenvalues, i.e. 



vi 
v 2 



(101) 



Figure l^shows the effect of projecting the cloud of points in Figure l20l onto 
the two main axes where the variance of the samples is maximized. Although 
useful for implementing dimensionality reduction — which favors visualization, 
redundancy reduction, and computational savings — the principal component anal- 
ysis method is limited as it does not explicitly consider the category of each indi- 
vidual. This limitation is overcome by the canonical variable analysis described 
below. Note that the obtained projections are little influenced by the measurement 
r, which is compatible with the fact that degree correlations are almost absent in 
B A networks E3I . 



18.2 Canonical Variable Analysis 

The method known as canonical variable analysis provides a powerful extension of 
principal component analysis by performing projections which optimize the sepa- 
ration between the known categories of objects. Before presenting the method, we 
introduce a series of scatter measurements considering each category separately 



57 



and also all categories, from which the overall criterion for class separation is de- 
fined. 

Let us consider that the R complex networks of interest can be divided into N c 
classes, each one with 2V* objects and identified as d, z = 1,2,..., N c , and that 
each object £ is represented by its respective feature vector x\ = (xi, X2, ■ ■ ■ , xp) T 
(see the previous section). The total scatter matrix, S, expressing the overall dis- 
persion of the measurements 1391 is defined as follows 

R rp 

s = J2(fi-&) (4-<*>) • ( l02 ) 

The scatter matrix for each class Cj is given as 

% = E (4 - $0 (4 - ^)i) T > < 103 ) 

where (x)j is the average feature vector of the class Q. 

The intraclass scatter matrix, providing the dispersion inside each of the classes, 
is defined as 

N c 

Sintra = E 5 *- (104) 
i=l 

Finally, the interclass scatter matrix, characterizing the dispersion between 
each pair of classes, is given as 

N C rp 

Sinter = X>< ({X)i ~ (X)) ((X^ ~ (s>) . (105) 
i=l 

It can be verified that 

S = Si n t r a + Winter- (106) 

The objective of the canonical analysis method is to maximize the interclass 
dispersion while minimizing the intraclass scattering (e.g., JT68I). This can be 
achieved through the following linear transformation 

X c = Tx( (107) 

where F = [7I, 72, . . . , '-f p ] T is chosen so that 71 maximizes the ratio 

71 S'intraTl 

and 7^, £ = 2, 3, . . . ,p, maximizes a similar ratio and 

7i T Sintra7l = °" ( 109 ) 
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Figure 22: A phase space (scatterplot) containing two distributions of points (a) and 
respective PCA (b) and canonical (c) projections. Note that neither the projections 
into the three main planes (a) nor the PCA projection (b) separate the distributions, 
in contrast to the canonical projection (c). 

It can be shown that the vectors j{ , 7^, . . . , 7^ correspond to the eigenvectors of 
the matrix Sr^S^ter- 

Figure illustrates a phase space (a) containing two distributions of obser- 
vations, as well as the respective PCA (b) and canonical analysis (c) projections 
considering two dimensions. The potential of the canonical approach for obtaining 
separated clusters is evident from this example. 

18.3 Bayesian Decision Theory 

The elegant and sound methodology known as Bayesian decision theory provides 
an intuitive and effective means for classifying objects into a given set of cate- 
gories. In principle, it is assumed that the mass probabilities Pi, as well as the con- 
ditional probability densities, p(x^\Ci), are all given or can be properly estimated 
(e.g., by using parametric or non-parametric methods, see I3U1 11671 091). The 
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mass probability Pj corresponds to the probability that an object, irrespective of 
its properties, belongs to class d, and therefore can be estimated from the respec- 
tive relative frequency. The conditional probabilities p(x^\Ci) provide a statistical 
model of how the measurements in the feature vectors are distributed inside each 
category. Given an object with unknown classification, the most likely category c 
to be assigned to it is the one for which the respectively observed feature vector x 
produces the highest value of P^p(x\C^). In case the probability functions are not 
available, it is still possible to use an approximate classification method, known 
as k-nearest neighbors (e.g. EUlO . which consists in identifying the set of the k 
individuals which are closer (i.e. smaller distance between feature vectors) to the 
sample to be classified, and take as the resulting category that corresponding to the 
most frequent class among the nearest neighbors. 

Let us illustrate the above concepts and methodology in terms of a situation in- 
volving three categories C\, C2 and C3 of complex networks, namely Geographical 
Network (GN), Watts-Strogatz small-world network (WS) and Erdos and Renyi 
random graph (ER), characterized in terms of their normalized average shortest 
path length I and Pearson correlation coefficient of vertex degrees r. The corre- 
sponding scatterplot is shown in Figure a). 

Usually we do not know the mass and conditional probabilities of each type 
of networks, so they have to be estimated from the available data. This stage 
can be understood as the training phase of the Bayesian decision theory method. 
There are two main ways to estimate the probabilities required: parametric and 
non-parametric. In the former, the mathematical form of the probability functions 
is known (e.g., normal distribution) and the respective parameters (mean and co- 
variance matrix, in the case of normal distributions) need to be estimated; in the 
latter, the mathematical type of the densities is unknown , being estimated, e.g., 
through some interpolation procedure such as the Parzen windows methodology 
BUI . Once the training phase is concluded, new objects whose classes are to be 
determined have their measurements estimated and used to identify, among the 
probability distributions of the trained models, which are the most likely respec- 
tive classes. This categorization procedure corresponds to the decision phase of 
the Bayesian methodology. 

Figure l23f b) illustrates the parametric approach, considering three normal den- 
sity distributions, as applied to the data in Figure I23T a). These distributions were 
defined by having their parameters (namely average vector and covariance matrix) 
estimated from the respective experimental measurements. The separating fron- 
tiers are shown in the projection at the bottom of the figure. The decision regions 
obtained by using non-parametric estimation through Parzen windows is shown in 
Figure l23l c). 

Note that a very high dimensional feature space implies that a substantially high 
number of individuals must be considered in order to obtain properly estimated (i.e. 
not too sparse) densities. Therefore, it is essential to limit the number of measure- 
ments to a small set of more discriminative features. An interesting alternative 
involves the use of canonical projections in order to reduce the dimensionality of 
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Figure 23: A scatterplot of normalized measurements containing several complex 
networks derived from three main categories, i.e. Geographical Network (GN), 
Watts-Strogatz (WS) and Erdos and Renyi random graph (ER) models (a), and re- 
spective Gaussians and decision regions obtained by using the Bayes method con- 
sidering parametric (b) and non-parametric (c) estimation. Network parameters are 
./V = 250, (k) = 20, with 1000 realizations for each model; rewiring probability in 
the WS model is 0.4. 
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the problem. A key open question which is briefly addressed in this section regards 
which of the several topological measurements available for complex networks 
characterization can yield the best characterization and discrimination among the 
principal network models. 

18.4 Combining Canonical Variable Analysis and Bayesian Decision 
Theory 

An interesting possibility for classifying networks involves the combination of 
canonical variable analysis and Bayesian decision theory (e.g., f^l 11681 11711 1. 
More specifically, the observations considered for the training stage are projected 
into a reduced dimensional feature space by using canonical analysis, so that the 
Bayesian decision method is applied not over the larger original features space, 
but onto a more manageable and representative features space. This possibility is 
explored in this section to address the important issue of classifying experimen- 
tal complex networks into three main categories defined by the similarity with the 
Barabasi- Albert (BA), Erdos and Renyi random graph (ER) and Geographical Net- 
work (GN) models. The following experimental networks are considered in our 
experiments: 

US Airlines Transportation Network (USATN): The USATN is composed by 332 
airports, localized in the United States in 1997, connected by flights. The data was 
collected from the Pajek datasets HI 721 . This kind of network exhibits a power law 
behavior as described in IU73ll62l . 

Protein-Protein Interaction Network of the Saccharomyces Cerevisiae (PPIN): PPIN 
is formed by 1922 proteins linked according to identified direct physical interac- 
tions 11741 . a dataset is available at the Center for Complex Network Research (The 
University of Notre Dame). The vertex degree distributions of protein-interaction 
networks tend to follow a power law 11741 . 

Autonomous System (AS): In the Internet, an AS is a collection of IP networks 
and routers under the control of one entity that presents a common routing policy 
to the Internet. Each AS is a large domain of IP addresses that usually belong 
to one organization such as a university, a business enterprise, or an Internet Ser- 
vice Provider. In this type of networks, two vertices (AS) are connected if there 
is at least one physical link between them. This kind of network is usually de- 
scribed by the Barabasi-Albert model 11751 and the data considered in our work 
is available at the web site of the National Laboratory of Applied Network Re- 
search (http : / /www . nlanr . net). We used the data collected in Feb. 1998, 
with the network containing 3522 vertices and 6324 edges. 

Transcriptional Regulation Network of the E. coli (TRNE): In this network, the 
vertices represent operons (an operon is a group of contiguous genes that are 
transcribed into a single mRNA molecule) and each edge is directed from an 
operon that encodes a transcription factor to another operon that is regulated by 
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that transcription factor. Hence, this kind of network, which is believed to be scale 
free 11471 . controls gene expression. We used the undirected version of the net- 
work analyzed by Shen-Orr et al. I147L which is formed by 577 interactions and 
424 operons. The original network was transformed into the undirected form by 
the operation of symmetry as described in Section |2 

Delaunay Network (DLN): This network was obtained by distributing a set of 
points (the vertices) uniformly (but with an exclusion radius in order to avoid points 
to become too close) along a unit square and obtaining the edges from the respec- 
tive Delaunay triangulation (e.g. 117610 . Therefore, each point defines a tile in 
the respective Voronoi diagram, and every pair of adjacent vertices are connected 
(see Figure l24l. The connectivity of this type of geometrical structure, henceforth 
called Delaunay network, is therefore completely determined by the adjacency be- 
tween the vertices, which is in turn defined by the geographical distribution of the 
vertices. As such, Voronoi networks provide one interesting extreme case of geo- 
graphical networks where only the immediate spatial neighborhood is considered 
for connection. The network considered here contains 251 vertices and 700 edges. 
Progressively rewired (degree preserving) versions of this network are also consid- 
ered in order to illustrate the evolution of trajectories in decision spaces. Figure l24l 
illustrates four of these successive configurations. 

A total of three sets of 300 realization of each reference model (BA, ER and 
GN) were then generated. The networks for each set were designed to have average 
vertex degrees near the experimental value. The model and experimental networks 
were characterized in terms of the following measurements: straightness st, aver- 
age vertex degree (k), Pearson correlation coefficient of vertex degrees r, average 
clustering coefficient C, average shortest path length I, central point dominance 
CPD, average hierarchical degree of second level {k 2 {i)), average hierarchical 
clustering coefficient of second level (62(2)) and average hierarchical divergence 
ratio of the third level (dv3(i)). 

In order to provide a general and representative view of the effect of these mea- 
surements in the classification of real networks by using the methodology involving 
the canonical analysis followed by Bayesian decision, we considered the following 
combinations of measurements: 

i. {£,st}, 

ii. {{k),C,£}, 

hi. {(k 2 (i)),(C 2 (i)),{dv 3 (i))}, 

iv. {st,r,CPD}, 

v. {(k), C,£, st, r, CPD}, 

vi. {{k),C,£,(k 2 (i)),(C 2 (i)),{dv 3 (i))}, 
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(a) (b) 




(c) (d) 



Figure 24: The Delaunay geographical network (DLN) for several numbers of 
rewirings: original (a) and after 60 (b), 120 (c) and 200 (d) rewirings. 
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vii. {st,r, CPD, (k 2 (i)}, (C 2 (i)>, («M0». 

viii. all measurements. 

The combination (i) was the only one that did not require canonical analysis. Ta- 
ble |4] shows the results, i.e. the theoretical model and respective average vertex 
degree which have been associated to each experimental network by the classifica- 
tion procedure, obtained for each of these configurations. More specifically, each 
experimental network was classified as having the same category as the theoreti- 
cal model defining the decision region in the canonical projection space where the 
feature vector of the experimental data was mapped. 

A number of interesting facts can be inferred from Table |4] To begin with, 
the compatibility between the type of network model expected and obtained for 
each of the experimental networks varies considerably for each case. The best 
compatibility was obtained for the DLN, i.e. the identified model was compatible 
with the expected type (geographical) for all considered combinations of measure- 
ments. Compatible average vertex degrees have also been obtained for cases (iii), 
(vi)-(viii). Figure |25] illustrates the location of this network in the scatterplot de- 
fined by the canonical projection of the combination of all measurements. In this 
figure, which also shows the separating frontiers of the decision regions, the ex- 
perimental network DLN (represented as o) resulted closer to GN with average 
vertex degree of 6. PPIN implied the highest number of incompatible classifica- 
tions which, instead of being identified as a scale-free network (as expected 11741 ). 
was understood as GN except for the cases {£, st} and {st, r, CPD}. A similar sit- 
uation was verified regarding the average vertex degrees. Figure l26T c) and (d) show 
the resulting position of this network within the scatterplots obtained by canonical 
projection of the combination of all measurements (c) and all except those hierar- 
chical (d). Note the good agreement between the resulting categories obtained for 
these two cases. In both cases, the PPIN resulted very close to the GN with average 
vertex degree of 3.03. 

A particularly interesting result has been obtained for the USATN, which tended 
to appear well away from all theoretical groups in most cases, as illustrated in the 
scatterplot shown in Figure I26T a) with respect to the case {(k), C, £}. Intermediate 
results were obtained for the other networks. For instance, TRNE has been classi- 
fied as expected (i.e. as a BA network) in 2 cases, identified as an ER in only one 
case and as a GN in 5 cases. Figure l26T b) shows the position of this network in the 
scatterplot defined for all measurements. Note that TRNE appears almost in the 
middle of the ER and GN types for average vertex degree of 2.45. 

It is also possible to use hierarchical clustering algorithms (e.g. RUl l39l 17581 ) 
in order obtain additional information about the relationship between the analyzed 
networks. Figure ^2j\ shows the dendrogram obtained for the situation depicted 
in Figure l26T c) by using Ward's aglomerative method. In this method the net- 
works, inititally treated as individual clusters, are progressively merged in order to 
guarantee minimal dispersion inside each cluster. The linkage distance is shown 
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Table 4: The classes assigned to the real networks by considering each combination of measurements. The classes in bold mean wrong 
identified model and, in italic style, wrong average vertex degree. The * symbol represents an identified class well away from all 
theoretical models (see, for instance, Figure l26T c)). 
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Delaunay 
Network (DLN) 
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GN 
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GN 
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GN 
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GN 
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Figure 25: Separating frontiers between the decision regions in the scatterplots 
obtained by canonical analysis for the DLN. The separating frontiers were obtained 
by Bayesian decision theory. Note the trajectory defined by the mapping of the 
progressively rewired versions of the original DLN network, extending from the 
GN towards the ER region with (k) = 6. 



67 



1.0 



to 0.0 



-1.0 



-2.0 



-3.0 



-4.0 




ISA. ,k 10 

BA, <k)= 12 
BA, <k) = 14 
ER, <k> = 10 
ER, <k>=12.8 
ER, <k) = 14 
GN, (k) = 10 
GN, <k> = 12.8 
GN, <k) = 14 
USATN, (k) = 12.8 



-1.0 -0.5 0.0 

first canonical variable 



0.5 



1.0 



> 

1 1.0 



CO 

o 

"D 



-1.0 



-2.0 



-2.0 



-1.0 



BA, (k> = 2 
BA, (k> = 4 
..•■BA, <k> = 6 

/ER, <k> = 2.45 
ER, (k> = 4 
ER, (k> = 6 
GN, <k> = 2.45 
GN, <k> = 4 
GN, W=6 

TRNE,.(kJ'=2.45 



0.0 1.0 

first canonical variable 



2.0 



3.0 



(a) 



(b) 



OS 



2.0 



0.0 



BA, <k> = 2 


+ 


BA, (k> = 4 




BA, <k> = 6, 




ER, <k>*'"2 


□ 


ER<"(k> = 3 




,.-E"R, <k> = 6 




" GN, (k> = 2 


■ 


GN, <k> = 3 


• 


GN, <k> = 6 




PPIN, (k> = 3.03 




/ \ 








M 


\ 

> 




-0.5 0.0 


0.5 




t 



\ 



2.5 - 

2.0 - \ 

1 _ \ ^ 

1.0 - ~ 

0.5 - 

0.0 - 

-0.5 - 

-1.0 - 
-1.5 



BA, <k> = 2 
BA, (te = 4 
BA, <k>V 6 
ER, <k> \ 2 □ 
ER, (k) 4 3 
ER, (k> = 
GN, (k> = 
GN, (k> = 3 
GN, <k> = 6 
PPIN, <k> = 3.03 




first canonical variable 



first canonical variable 



(c) 



(d) 



Figure 26: Examples of classification by canonical variable analysis and Bayesian decision theory: (a) US Airlines Transportation 
Network (USATN); (b) the Transcriptional Regulation Network of the E. coli (TRNE); and (c) the Protein-Protein Interaction Network of 
the Saccharomyces Cerevisiae (PPIN), considering all measurements; (d) the same protein network as in (c) but excluding the hierarchical 
measurements. Note the presence of the separating frontiers between the decision regions in the scatterplots. The arrows indicate the 
experimental networks. 
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GN (6) GN (3) ER (6) ER (2) BA (4) 

PPIN (3.03) GN (2) ER (3) BA (6) BA (2) 



Figure 27: Dendrogram obtained for the protein-protein interaction network con- 
sidering all measurements except those hierarchical. Note that the BA, ER and GN 
networks resulted in well-separated branches, while the protein-protein network 
was included into the latter group. 



along the y— axis, indicating the point where the clusters are merged (the sooner 
two clusters are merged, the most similar they are). The similarity between the 
cases belonging to each of the three types of networks is reflected by the fact that 
three respective main branches are obtained in the dendrogram in Figure ^2j\ The 
GN cluster incorporates the experimental protein-protein network, to which it is 
most closely related by the measurements. Note that the GN group, including the 
protein-protein network, is significantly different from the ER and BA models at 
the right-hand side of the figure, as indicated by the high linkage distance at which 
these two groups (i.e. the GN and ER/BA) are merged. 

The results discussed above illustrate the classification procedure and its po- 
tential for identifying the category of networks of unknown nature. The fact that 
the assigned category sometimes varies according to the choice of measurements 
suggests the presence of specific topological features in some experimental net- 
works which are not fully compatible with any of the assumed theoretical models. 
Indeed, the consideration of more measurements can, in principle, provide a more 
comprehensive subclassification of the networks. Such a possibility is particularly 
important in the case of scale-free networks, which are known to involve subtypes 
B31 . For instance, TRNE has been identified in our experiments as having BA 
type while considering two measurements (i.e. {st, £}), but was understood as a 
GN model by considering three measurements (i.e. {{k),C,£}) and as ER when 
we considered six measurements (i.e. {(k), C, I, st, r, CPD}). 
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It should be always kept in mind that the consideration of an excessive num- 
ber of measurements may ultimately compromise the quality of the classification. 
Methodologies such as the canonical analysis followed by Bayesian classification 
can be used to identify the features which contribute the most for the correct clas- 
sifications. This can be done by considering the measurements which contribute 
more intensely for the canonical projections providing the largest number of cor- 
rect classifications. A simpler methodology involves the application of the princi- 
pal component analysis to remove the redundancies between the measurements. In 
the case of a reduced number of measurements it is also possible to consider all the 
respective combinations and evaluate which of them yields the best classifications. 
Another interesting possibility for investigating complex network connectivity is 
to consider outliers analysis (e.g. II166I0 . The reader interested in additional infor- 
mation on multivariate statistics and feature selection is referred to the specialized 
literature (e.g., ll40ll39l[T68lll67l ') for more in-depth discussion and coverage. 

19 Concluding Remarks 

Measurements of the connectivity and topology of complex networks are essen- 
tial for the characterization, analysis, classification, modeling and validation of 
complex networks. Although initially limited to simple features such as vertex de- 
gree, clustering coefficient and shortest path length, several novel, powerful mea- 
surements have been proposed. We hope it has been made clear that the several 
available measurements often provide complementary characterization of distinct 
connectivity properties of the structures under analysis. It is only by becoming 
familiar with such measurements that one can expect to identify proper sets of fea- 
tures to be used for the characterization of complex networks. The current survey 
has been organized to provide a comprehensive coverage of not only the most tra- 
ditional measurements but also complementary alternatives which, though not so 
frequently used, can provide valuable resources for characterizing specific topo- 
logical properties of complex networks. Special attention was also given to the 
application of measurements in community finding algorithms, an important issue 
in complex network research. 

In addition to presenting such measurements according to coherent categories, 
we also addressed issues such as visualization, in terms of trajectories defined by 
measurements, of complex network growth. As illustrated by the results presented, 
which considered several important theoretical network models, such trajectories 
clearly reflect, in graphical terms, important tendencies exhibited by different net- 
work categories as their average degree is increased. Another important point to be 
kept in mind in network measurements is correlations. While high correlation be- 
tween a pair of measurements indicates that they are largely redundant, our results 
show that the own correlation values vary from one network model to another, pro- 
viding further useful information for network characterization. Another important 
property of a specific measurement is its sensitivity to small perturbations in the 
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network, such as the inclusion or removal of edges or vertices. We illustrated that 
different measurements can behave very differently with respect to such induced 
changes. Because one of the most challenging issues related to network catego- 
rization regards the choice of the features to be taken into account, we provided a 
self-contained discussion about how multivariate statistics concepts and methods 
can be applied for that aim. More specifically, we showed how high dimensional 
measurement spaces can be effectively projected, by using principal component 
analysis, into lower-dimensional spaces favoring visualization and application of 
computationally intensive measurements. We also described how two useful meth- 
ods, namely canonical analysis and Bayesian decision theory, can be combined to 
provide the means for semi-automated identification of the effective linear com- 
binations of measurements, in the sense of allowing good discrimination between 
network categories. The potential of such multivariate methodologies was illus- 
trated for theoretical models and experimental networks. The results clearly sug- 
gested that considering a comprehensive set of measurements can provide more 
complete characterization of the topological properties of the networks to the point 
of requiring a revision of the traditional classification of experimental networks 
into subclasses or new models. 

All in all, this survey provides for the first time an integrated presentation and 
discussion of a comprehensive set of measurements previously covered in sepa- 
rated works. In addition, it addresses important issues related to application of 
these measurements for characterization and classification of networks, including 
dynamical representations in terms of trajectories, redundancy between measure- 
ments as quantified by correlations, perturbation effects and a powerful multivari- 
ate framework for classification of networks of unknown category. The systematic 
application of such concepts and tools is poised to yield a wealthy of new results 
in the study of complex networks. 
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Table 5: Summary of discussed measurements. 



Measurement 


Symbol 


Equation 


Mean geodesic distance 


t 


o 


Global efficiency 


E 


d 


Harmonic mean distance 


h 




Vulnerability 


V 




Network clustering coefficient 


CandC 


(EU and El 


Weighted clustering coefficient 




El 


Cyclic coefficient 


e 




Maximum degree 


k 

"rnax 


El 


Mean degree of the neighbors 




El 


Degree-degree correlation coefficient 


r 




Assortativity coefficient 


Q, Q 


d42li and gl 


Bipartivity degree 


b and 


(El and (El 


Degree Distribution entropy 


H(i) 


El 


Average search information 


S 


(ED 


Access information 


A, 


El 


Hide information 


Hi 


El 


Target entropy 


T 


El 


Road entropy 


K 


El 


Betweenness centrality 


Bi 


El 


Central point dominance 


CPD 


El 


Zth moment 


M x 


(ED 


Modularity 


Q 


(H 


Participation coefficient 


Pi 


El 


z-score 




El 


Significance profile 


SPi 


G3 


Subgraph centrality 


sc 


ii 


Hierarchical clustering coefficient 


C rs 


El 


Convergence ratio 


cv d {i) 
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